Build software better, together

tfmorris / pdf2table

PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz

information-extraction table-extraction pdf-analysis

Updated Mar 15, 2024
Java

SreejaBethu / Smart-Report-Analyzer

Star

An AI-powered LLM app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.

python nlp question-answering data-analysis summarization huggingface streamlit pdf-analysis llm

Updated Oct 23, 2025
Python

michael-eble / pdf-analysis-word-extraction-word-frequencies

Star

PDF Analysis: Extracting words and their word frequencies from PDF files; Preparation of text data for performing topic analysis on annual reports of German car manufacturers - e.g. Volkswagen, Porsche and Audi. Please note that words are only being extracted, stemming is not being applied. In order to improve this, use nltk.stem.snowball.Snowba…

natural-language-processing german nlp-parsing pdf-analysis

Updated Sep 11, 2019
Python

jlmayorgaco / r-biblio-synth

Star

This project focuses on automating the analysis and reporting of bibliometric data, specifically targeting the annual production of academic articles. The primary goal is to understand trends, anomalies, and patterns in bibliometric data through a combination of statistical modeling and exploratory data analysis.

data-science data-visualization data-analysis scopus r-language systematic-reviews anomaly-detection bibliometrics literature-review data-analysis-in-r report-generation regression-modeling pdf-analysis research-tools bibliometrix-package llm-integration trending-analysis reserach-

Updated Nov 18, 2025
PostScript

lhiebert01 / GenAI_PDF_App

Star

Advanced PDF analysis and question-answering application powered by Google's Gemini Pro AI. Upload PDFs and get intelligent, structured responses to your questions about the document content

python streamlit pdf-analysis langchain genai-chatbot gemini-pro

Updated Dec 2, 2024
Python

MaliosDark / Pdf-infected-Virus-Scanner-Online

Star

A secure, AI-enhanced file scanning tool built on Flask, strengthened with ClamAV and PDF analysis, designed to vigilantly detect digital threats and potential vulnerabilities.

flask ai sqlite web-application clamav cybersecurity malware-detection ai-security threat-detection pdf-analysis digital-security file-scanning

Updated Sep 9, 2024
HTML

Rakshath66 / Chat-With-Your-PDF

Star

Streamlit-based chatbot to interact with PDFs using Retrieval-Augmented Generation (RAG), FAISS, Sentence Transformers, and Mistral LLM

Updated Jul 3, 2025
Python

divyeshmutha12 / AI-Research-Assistant

Star

An AI-powered Multi-Agent Research Assistant built with Generative AI, Agentic AI, LangGraph, RAG, Persistent Memory, FAISS, and FastAPI. It supports multi-format uploads, intelligent PDF analysis, and expert-like Q&A via Researcher, Summarizer, Critic, and Editor agents—offering deep, contextual, and interactive research insights.

openai multiagent-systems faiss rag research-assistant fastapi vector-database pdf-analysis ai-chatbots generative-ai langgraph agentic-ai

Updated Oct 9, 2025
Python

PKHarsimran / IOC-Inspector

Star

Fast, SOC‑ready malicious document scanner that turns suspicious PDFs, DOC(X), XLS(X), and RTFs into IOC‑rich, SIEM‑friendly reports.

python cli ioc static-analysis cybersecurity malware-analysis threat-intelligence abuseipdb virus-total soc-tools pdf-analysis office-macros

Updated Jul 23, 2025
Python

RaghuSharma14 / PDF-Reader

Star

A PDF Reader application powered by AI, allowing users to upload PDF documents and extract meaningful information using advanced NLP models. Built with Streamlit, Transformers, and Langchain, this app provides a seamless interface for interacting with and analyzing PDF content.

machine-learning automation transformers text-extraction pdf-reader pdf-extraction streamlit pdf-analysis langchain natural-language-processing-nlp

Updated Apr 24, 2025
Python

MahirSalahin / chat-pdf

Star

A RAG project. Chat PDF

chat-application faiss streamlit pdf-analysis langchain chat-pdf gemini-chat

Updated Aug 30, 2024
Python

bylickilabs / pdfAnalyzer

Sponsor

Star

PDF Analyzer** ist ein effizientes Python-Tool zur automatischen Analyse von PDF-Dokumenten.

python cli open-source metadata pdf text-mining automation reporting document-analysis document-processing file-analyzer pdf-extraction streamlit pdf-analysis file-inspector

Updated Jun 30, 2025
Python

FrancescoRomeo02 / multimodalragApp

Star

Advanced multimodal RAG system for querying PDF documents with text, images, and tables using vector embeddings, semantic chunking, and LLMs via Groq API

nlp machine-learning ai computer-vision chatbot semantic-search multimodal rag groq streamlit pdf-analysis document-intelligence qdrant langchain

Updated Jul 29, 2025
Python

Ouns-AN / pdf-page-counter

Star

Offline web app to count pages in PDF files using PDF.js

pdf counter csv offline simple drag-and-drop pyqt5 vanilla-js client-side pdfjs pdf-tools page-counter pdf-analysis simple-tools

Updated Dec 14, 2025
JavaScript

mkapulica / PDF-Page-Counter

Star

An extremely fast and user-friendly PDF page counter app for multiple PDF files.

python pdf-tools pdf-analysis pdf-page-count

Updated Jun 10, 2024
Python

frankwiersma / pdf-chat-gemini

Star

Intelligent PDF document analysis using Google Gemini AI with File Search capabilities

python file-search document-search streamlit pdf-analysis ai-assistant gemini-ai

Updated Nov 22, 2025
Python

marcusmcb / ai-pdf-tutor

Star

Demo AI app that summarizes PDF documents via text & voice

text-to-speech ai data-parsing pdf-analysis

Updated Jul 1, 2025
TypeScript

rishisolanke / PDF_Query_Langchain

Star

PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.

python nlp natural-language-processing artificial-intelligence openai data-analysis research-tool pdf-extraction pdf-analysis langchain document-query

Updated Jul 23, 2024
Python

rohanag03 / PDF-Insights

Star

This project uses Google's Generative AI to analyze and answer questions about PDF content. It provides a user-friendly interface to upload PDFs and receive insightful answers generated by the Gemini AI model.

python gemini-api google-ai pdf-analysis generative-ai

Updated Jun 16, 2024
Python

colingalbraith / OpenRAGSearch

Star

Local RAG-powered document analysis platform with PDF QA, Ollama integration, and citation-aware search.

semantic-search offline-app rag fastapi vector-search pdf-analysis document-intelligence langchain chromadb local-llm retrieval-augmented-generation ollama

Updated Jul 26, 2025
JavaScript

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-analysis

Here are 30 public repositories matching this topic...

tfmorris / pdf2table

SreejaBethu / Smart-Report-Analyzer

michael-eble / pdf-analysis-word-extraction-word-frequencies

jlmayorgaco / r-biblio-synth

lhiebert01 / GenAI_PDF_App

MaliosDark / Pdf-infected-Virus-Scanner-Online

Rakshath66 / Chat-With-Your-PDF

divyeshmutha12 / AI-Research-Assistant

PKHarsimran / IOC-Inspector

RaghuSharma14 / PDF-Reader

MahirSalahin / chat-pdf

bylickilabs / pdfAnalyzer

FrancescoRomeo02 / multimodalragApp

Ouns-AN / pdf-page-counter

mkapulica / PDF-Page-Counter

frankwiersma / pdf-chat-gemini

marcusmcb / ai-pdf-tutor

rishisolanke / PDF_Query_Langchain

rohanag03 / PDF-Insights

colingalbraith / OpenRAGSearch

Improve this page

Add this topic to your repo