n8nflow.net logo

Turn Any PDF into a Clean Google Doc with Mistral OCR

by HunyaoUpdated: Last update 2 months agoSource: n8n.io
Loading workflow viewer...

Getting Started

Upload a PDF and instantly get a neatly formatted Google Doc with all the readable text—no manual copy-paste, no messy line breaks.

What this workflow does

  • Accepts PDF uploads via a public form
  • Sends the file to Mistral Cloud for high-accuracy OCR
  • Detects and merges page images with their extracted text
  • Cleans headers, footers, broken lines, and noise
  • Creates a new Google Doc in your chosen Drive folder
  • Writes the polished markdown text into the document

What you need

  • Mistral Cloud API key with OCR access
  • Google Docs & Drive credentials connected in n8n
  • Drive folder ID for new documents
  • A PDF file to process (up to 100 MB)

Setup

  1. Import the workflow into n8n and activate credentials.
  2. In Trigger • Form Submission , copy the webhook URL and share it or embed it.
  3. In Create • Google Doc , replace the default folder ID with yours.
  4. Fill out Mistral API key under Mistral Cloud API credentials.
  5. Save and activate the workflow.
  6. Visit the form, upload a PDF, name your future doc, and submit.
  7. Open Drive to view your newly generated, clean Google Doc.

Example use cases

  • Convert annual reports into editable text for analysis.
  • Extract readable content from scan-only invoices for bookkeeping.
  • Turn magazine PDFs into draft blog posts.
  • Digitize lecture handouts for quick search and annotation.
  • Convert image-heavy landing pages / advertorials into editable text for AI to analyze structure and content.