JUHE API Marketplace

n8n Webpage Content Simplification Workflow

Active

For platform n8n, this automated workflow processes manual chat messages to fetch and simplify webpage content using HTTP requests. It extracts the main HTML body, removes unnecessary tags, and converts the content to Markdown format. By offering full or simplified output options, it ensures users receive concise information while managing lengthy page content effectively, enhancing clarity and usability.

Workflow Overview

For platform n8n, this automated workflow processes manual chat messages to fetch and simplify webpage content using HTTP requests. It extracts the main HTML body, removes unnecessary tags, and converts the content to Markdown format. By offering full or simplified output options, it ensures users receive concise information while managing lengthy page content effectively, enhancing clarity and usability.

This workflow is ideal for:

  • Developers looking to integrate web content fetching into their applications.
  • Data Analysts who need to gather and process web data efficiently.
  • Content Creators who want to extract and simplify webpage content for easier consumption.
  • Businesses that require automated data collection from web sources for analysis or reporting.

This workflow addresses the challenge of automatically fetching and processing webpage content. It allows users to:

  • Retrieve HTML content from a specified URL.
  • Handle errors gracefully, providing informative messages.
  • Simplify the content by removing unnecessary tags and links, making it easier to read or convert to Markdown format.
  • Limit the length of the output to avoid excessive data processing.
  1. Manual Trigger: The workflow starts when a user sends a manual chat message.
  2. OpenAI Chat Model: It utilizes OpenAI's GPT-4 model to process user input and prepare for web requests.
  3. Extract Query Parameters: The workflow extracts necessary query parameters from the user's input.
  4. HTTP Request: It makes an HTTP request to the specified URL, allowing for both full and simplified content fetching.
  5. Error Handling: If an error occurs during the HTTP request, the workflow captures it and prepares an appropriate error message.
  6. Extract HTML Body: The HTML body content is extracted from the response.
  7. Content Cleanup: Unwanted tags (like <script>, <style>, etc.) are removed from the content.
  8. Simplification Check: If the user requested simplified content, further modifications are made to remove URLs and image links.
  9. Convert to Markdown: The cleaned HTML is converted to Markdown format for easier readability.
  10. Send Page Content: Finally, the processed content is sent back to the user, ensuring it does not exceed the specified length limit.

Statistics

20
Nodes
0
Downloads
24
Views
10312
File Size

Quick Info

Categories
Complex Workflow
Manual Triggered
+1
Complexity
complex

Tags

manual
advanced
api
integration
logic
conditional
complex
sticky note
+3 more

Boost your workflows with Wisdom Gate LLM API

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.