klotz: scraping* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. The author records a screen capture of their Gmail account and uses Google Gemini to extract numeric values from the video.
  2. Cloudflare plans to launch a marketplace where website owners can sell AI model providers access to scrape their content. This move aims to give publishers more control over their content and monetization opportunities in the AI era.
    2024-09-23 Tags: , , , , by klotz
  3. Parsera is a simple and fast Python library for scraping websites using Large Language Models (LLMs). It's designed to be lightweight and minimize token usage for speed and cost efficiency.
    2024-08-16 Tags: , , , , , by klotz
  4. Reworkd is a platform that simplifies web data extraction, using LLM code generation to help businesses scale their web data pipelines. No coding skills required.
  5. Mariya Mansurova explores using CrewAI's multi-agent framework to create a solution for writing documentation based on tables and answering related questions.
    2024-06-25 Tags: , , , , , , by klotz
  6. AI Helps Make Web Scraping Faster And Easier: Scrapegraph-ai is a new tool that uses large language models (LLMs) to automate the process of web scraping and data processing.
    2024-05-10 Tags: , , by klotz
  7. Scrapegraph-ai is a Python library for web scraping using AI. It provides a SmartScraper class that allows users to extract information from websites using a prompt. The library uses LLM models like Ollama, OpenAI, Azure, Gemini, and others for information extraction.
    2024-05-03 Tags: , , , , by klotz
  8. AutoCrawler is a two-stage framework that leverages the hierarchical structure of HTML for progressive understanding and aims to assist crawlers in handling diverse and changing web environments more efficiently. This work introduces a crawler generation task for vertical information web pages and proposes the paradigm of combining LLMs with crawlers, which supports the adaptability of traditional methods and enhances the performance of generative agents in open-world scenarios. Generative agents, empowered by large language models, suffer from poor performance and reusability in open-world scenarios.
    2024-04-28 Tags: , , , , by klotz
  9. train models for processing documents based on specific needs and requirements. It offers capabilities such as entity recognition, key information extraction, and data validation,
    2024-01-12 Tags: , , , , by klotz
  10. 2023-10-27 Tags: , , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: scraping + llm

About - Propulsed by SemanticScuttle