September 4, 2025
LSA Workflow Migration
This project successfully transitioned the Large-Scale Automation (LSA) pipeline from Make.com to n8n, prioritizing enhanced architectural control and long-term cost-efficiency. The automated system orchestrates a sophisticated data journey, starting with URL retrieval from Google Sheets and branching into specialized extraction paths: Apify handles complex web scraping, while a dedicated PDF extractor manages document text. By migrating to a self-hosted or more flexible environment like n8n, the workflow gains the ability to handle high-volume document processing with a more granular, scalable structure than its predecessor.
The core of the system leverages a hybrid AI approach, utilizing both Gemini and Grok to generate dual summaries of the extracted content. To ensure the data is not only stored but also searchable and actionable, the workflow integrates a modern data stack: MongoDB serves as the primary repository for structured summaries, while Pinecone manages vector embeddings generated via Hugging Face. This combination of open-source flexibility and powerful LLM integration results in a robust, professional-grade automation pipeline that significantly reduces operational overhead while maintaining high accuracy in text synthesis.