n8nflow.net logo

Website Content Scraper & SEO Keyword Extractor with GPT-4o-mini and Airtable

by Abhishek PatoliyaUpdated: Last update 3 months agoSource: n8n.io
Loading workflow viewer...

Getting Started

This workflow allows you to scrape website content, clean the HTML, extract structured information using GPT-4o-mini, and store the results along with SEO keywords into Airtable. Ideal for building keyword lists and organizing web content for SEO research.


Setup Instructions

1. Prerequisites

  • n8n Community or Cloud instance
  • Airtable account with a base and table ready
  • OpenAI API Key with access to GPT-4o-mini

2. Airtable Structure

Ensure your Airtable table has the following fields:

Field NameTypeNotes
Website NameStringName or URL of the website
DataStringCleaned website text
KeywordStringExtracted SEO keyword list
StatusOptionsValues: Todo, In progress, Done

3. Node Setup

Form Trigger:
Collects website URL from the user.

HTTP Request:
Fetches the website content.

HTML Cleaner (Code Node):
Strips out styles, tags, and whitespace to get clean text.

Topic Extractor (AI Agent + GPT-4o-mini):
Extracts topic-wise information from the cleaned website content.

Text Cleaner (Code Node):
Removes unwanted symbols like ### and **.

Keyword Extractor (AI Agent + GPT-4o-mini):
Generates a list of 90 important SEO keywords.

Airtable Upsert:
Stores the cleaned data, keywords, and status in Airtable.


4. Key Features

✅ Automatic website content scraping
✅ Clean HTML and extract plain text
✅ Use GPT-4o-mini for topic-wise information extraction
✅ Generate 90-keyword SEO lists
✅ Store and manage data in Airtable


5. Use Cases

  • SEO Keyword Research
  • Competitor Website Content Analysis
  • Structured Website Data Collection

Additional Workflow Recommendations

✅ Rename Nodes for Clarity

Current NameSuggested Name
Website NameWebsite URL Input Form
HTTP RequestFetch Website Content
CodeHTML to Plain Text Cleaner
Split Out1Clean Text Splitter
AI Agent1Topic Extractor (GPT-4o-mini)
Code1Text Cleanup Formatter
Split Out2Final Text Splitter
AI AgentKeyword Extractor (GPT-4o-mini)
AirtableAirtable Data Upsert
Wait1Delay Before Merge
MergeCombine Data for Airtable