Simplifies the retrieval, extraction, and training of structured data from various unstructured sources.
-
Updated
Oct 22, 2025 - Python
Simplifies the retrieval, extraction, and training of structured data from various unstructured sources.
Structured data extraction from research literature
A ruby gem to extract structured data from Google Local Search Results using the serpapi/bert-base-local-results model, enabling parsing, classification, and information extraction from English HTML content.
find a template of many similar html files
Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.
A new package that leverages language models to transform structured YAML data into well-formatted resume PDFs. Users provide their resume details in YAML format, and the package extracts key informat
A new package facilitates extracting a concise, structured summary from user-provided news headlines or brief texts by utilizing pattern matching and LLM interactions. This tool aims to help researche
This new package facilitates extracting structured insights from text-based content related to domain-specific issues, such as analyzing DNS blocking reports. Given unstructured text describing networ
A new package is designed to analyze financial news headlines and extract key structured information such as company names, financial targets, timeframes, and goal updates from text inputs. It simplif
A new package designed to facilitate structured extraction of key information from scientific or factual text inputs, enabling precise summaries, data extraction, or categorization based on user promp
A new package that analyzes technical arguments and extracts structured summaries from text discussions about infrastructure-as-code practices. It takes user-provided text (such as forum posts, articl
A Python-based tool for extracting structured data from PDFs using OCR and regex, and exporting it to CSV. Ideal for processing invoices, logs, or scanned documents into organized, usable datasets.
AI-powered assistant for analyzing Engineering Change Orders (ECOs) using Google Gemini and RAG
AI web data extraction
metadata extraction utility
powerautomate chatgpt smart email parsing
ai powered structured extraction
website extraction structured data
schema.org structured data scraper
JL data parsing automation
Add a description, image, and links to the structured-data-extraction topic page so that developers can more easily learn about it.
To associate your repository with the structured-data-extraction topic, visit your repo's landing page and select "manage topics."