Automatically finds PDF file in Google Drive and extracts information. Use it to pull out clean output. It then formats the output into a clean JSON object.
1. Manual Trigger starts the process.
2. πFind File: "Google Drive" node finds the PDF file/files in a specified folder and downloads it/them.
3. πExtract Raw Text: "Extract From File" node pulls the text content from the retrieval file/files.
4. β
Output Clean Data: "Code" node refines the extracted content and runs custom code for cleaning and final formatting.
Setup time: < 5 minutes
Add Credentials in n8n:
Credentials
> New
to add them if you haven't created yet.Configure the Search Node (Get PDF Files/File
):
File/Folder
. In "Search Method" field, select "Search File/Folder Name", In "Search Query" type in *.pdf
.Define Extraction Rules (Extract Files/File's Data
):
Clean & Format Data (Optional):
Get PDF Data Only
node to keep only the fields you need and give them friendly names.Data Parser & Cleaner
node if you need to perform custom transformation.Activate and Run:
Thatβs it! Once configured, this workflow becomes your personal data assistant. Run it anytime you need to extract information quickly and accurately, saving you hours of manual work and ensuring your data is always ready to use.