Chat with Multiple PDFs using RAG

Introduction

The project showcases a system where users can upload their pdf and can ask questions related to the pdf which will be answered by the chatbot. The PDF is first read and then processed. It is first chunked down into multiple chunk of texts, which are then converted to vector embeddings and stored in a vector database. The user then asks a question based on the pdf contents. The question is converted into a question embedding and then semantic search is used to generate the ranked results from the vector store, which is then passed to the LLM to generate the answer of the question asked by the user based on the PDFs.

Getting Started

Clone this repository:

git clone https://github.com/sumitaryal/chat-with-multiple-pdfs.git

Create a virtual environment:

python -m venv /path/to/new/virtual/environment

Install all the required packages:
```
pip install -r requirements.txt
```
Setup the environment variables in the .env file as HUGGINGFACEHUB_API_TOKEN The token should be write-based token.
Run the project:
```
python app.py
```

Here is the snippet of the UI

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
pdf files		pdf files
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chat with Multiple PDFs using RAG

Introduction

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Languages

sumitaryal/chat-with-multiple-pdfs

Folders and files

Latest commit

History

Repository files navigation

Chat with Multiple PDFs using RAG

Introduction

Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages