Weaver: Interweaving SQL and LLM for Table Reasoning

Abstract

Weaver is a modular pipeline that dynamically combines SQL and Large Language Models (LLMs) for advanced table-based question answering. Unlike rigid approaches, Weaver generates flexible execution plans that use SQL for structured data operations and LLMs for semantic reasoning, automatically deciding the best tool for each subtask. Our method consistently outperforms state-of-the-art approaches across four major TableQA datasets while reducing API costs and improving accuracy through intelligent query decomposition.

Paper: Weaver: Interweaving SQL and LLM for Table Reasoning

🚀 Quick Start

Installation

Clone the repository:

git clone https://github.com/rohitkhoja/weaver.git
cd weaver

Install dependencies:

pip install -r requirements.txt

Install Weaver:

pip install -e .

Setup Configuration

Copy the environment template:

cp .env.example .env

Configure your .env file with the following essential settings:

# 🔑 REQUIRED: LLM API Key (choose one provider)
OPENAI_API_KEY=your-openai-api-key-here

# 🎯 REQUIRED: LLM Model (LiteLLM format: provider/model)
LLM_MODEL=openai/gpt-4o-mini

# 📁 REQUIRED: Dataset Directory (where your CSV files are stored)
WEAVER_DATASETS_DIR=./datasets

# 🗄️ REQUIRED: Database Configuration (MySQL recommended)
WEAVER_DB_TYPE=mysql
WEAVER_DB_HOST=localhost
WEAVER_DB_PORT=3306
WEAVER_DB_NAME=weaver_db
WEAVER_DB_USER=root
WEAVER_DB_PASSWORD=your-mysql-password

# 📊 Optional: Logging Level
WEAVER_LOG_LEVEL=INFO

⚠️ Important: For now, use MySQL as the database backend. Support for other databases is in progress.

MySQL Setup

Make sure you have MySQL installed and running:

# Create database
mysql -u root -p
CREATE DATABASE weaver_db;
exit

💡 Usage Examples

Single Question Answering

from weaver import TableQA, WeaverConfig

# Initialize with environment configuration
config = WeaverConfig.from_env()
qa = TableQA(config)

# Ask a question using JSON object format
question_obj = {
    "table_id": "example-001",
    "question": "Which country had the most cyclists finish within the top 10?",
    "table_file_name": "./datasets/WikiTableQuestions/csv/203-csv/733.csv",
    "target_value": "Italy",
    "table_name": "2008 Clásica de San Sebastián"
}

result = qa.ask(question_obj)
print(f"Answer: {result.answer}")
print(f"Correct: {result.is_correct}")

Batch Processing

from weaver import TableQA, WeaverConfig

config = WeaverConfig.from_env()
qa = TableQA(config)

# Process multiple questions from a dataset
results = qa.evaluate_dataset(
    dataset_name="wikitq",
    data_path="./datasets/wikitq.json",
    num_samples=100
)

# Calculate accuracy
accuracy = sum(r.is_correct for r in results) / len(results)
print(f"Accuracy: {accuracy:.2%}")

Using with Context (FinQA Example)

question_obj = {
    "table_id": "ADI/2011/page_61.pdf",
    "question": "What is the percentage change in cash flow hedges in 2011 compared to 2010?",
    "table_file_name": "./datasets/FINQA/csv/ADI_2011_page_61.csv",
    "target_value": "9.9%",
    "table_name": "ADI/2011/page_61.pdf",
    "paragraphs": "Additional context about cash flow hedges and financial data..."
}

result = qa.ask(question_obj)
print(f"Answer: {result.answer}")

Command Line Interface

# Ask a single question
python -m weaver.cli.main ask "Which country won the most medals?" \
    --table-path ./datasets/olympics.csv

# Evaluate on a dataset
python -m weaver.cli.main evaluate wikitq \
    --data-path ./datasets/wikitq.json \
    --num-samples 50

# Show configuration
python -m weaver.cli.main config-info

📁 Dataset Structure

Weaver supports multiple TableQA datasets. Place your data in the structure specified by WEAVER_DATASETS_DIR:

datasets/
├── WikiTableQuestions/
│   └── csv/
│       └── 203-csv/
│           └── 733.csv
├── FINQA/
│   └── csv/
│       └── ADI_2011_page_61.csv
├── TabFact/
│   └── csv/
├── OTT-QA/
│   └── tables/
├── wikitq.json          # Question dataset
├── finqa.json           # Question dataset
├── tabfact.json         # Question dataset
└── ott-qa.json          # Question dataset

Question Dataset Format

[
  {
    "table_id": "nu-0",
    "question": "Which country had the most cyclists finish within the top 10?",
    "table_file_name": "./datasets/WikiTableQuestions/csv/203-csv/733.csv",
    "target_value": "Italy",
    "table_name": "2008 Clásica de San Sebastián",
    "paragraphs": "Optional context text..."
  }
]

🛠️ Configuration

Environment Variables Reference

Variable	Description	Example	Required
`OPENAI_API_KEY`	OpenAI API key	`sk-proj-...`	✅
`LLM_MODEL`	LLM model in LiteLLM format	`openai/gpt-4o-mini`	✅
`WEAVER_DATASETS_DIR`	Path to datasets directory	`./datasets`	✅
`WEAVER_DB_TYPE`	Database type	`mysql`	✅
`WEAVER_DB_HOST`	Database host	`localhost`	✅
`WEAVER_DB_PORT`	Database port	`3306`	✅
`WEAVER_DB_NAME`	Database name	`weaver_db`	✅
`WEAVER_DB_USER`	Database username	`root`	✅
`WEAVER_DB_PASSWORD`	Database password	`your_password`	✅
`WEAVER_LOG_LEVEL`	Logging level	`INFO`	⚪
`LLM_TEMPERATURE`	Model temperature	`0.01`	⚪
`LLM_MAX_TOKENS`	Max output tokens	`2048`	⚪

Supported LLM Providers

Weaver uses LiteLLM and supports 100+ LLM providers:

# OpenAI
export OPENAI_API_KEY="sk-..."
export LLM_MODEL="openai/gpt-4o-mini"

# Anthropic Claude
export ANTHROPIC_API_KEY="sk-ant-..."
export LLM_MODEL="anthropic/claude-3-sonnet-20240229"

🧪 Experiments & Results

Weaver has been evaluated on four major TableQA datasets:

WikiTableQuestions: Complex reasoning over Wikipedia tables
TabFact: Fact verification over tables
FinQA: Financial reasoning with numerical tables
OTT-QA: Open table-and-text QA

Our experiments show that Weaver consistently outperforms state-of-the-art methods while reducing API calls and error rates.

For detailed results and analysis, see our paper.

🏗️ Architecture

Weaver's modular pipeline consists of:

Table Preprocessor: Handles table loading and column filtering
Context Manager: Manages paragraphs and external context
Plan Generator: Creates step-by-step execution plans
SQL-LLM Executor: Dynamically executes SQL and LLM operations
Answer Extractor: Formats and validates final answers

The system dynamically decides when to use SQL for structured operations and when to leverage LLMs for semantic reasoning.

🤝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

📝 Citation

If you use Weaver in your research, please cite our paper:

@misc{khoja2025weaverinterweavingsqlllm,
      title={Weaver: Interweaving SQL and LLM for Table Reasoning}, 
      author={Rohit Khoja and Devanshu Gupta and Yanjie Fu and Dan Roth and Vivek Gupta},
      year={2025},
      eprint={2505.18961},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2505.18961}, 
}

🙏 Acknowledgments

This work was inspired by and builds upon several important contributions in the field:

BlendSQL : A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra
ProTrix : Building Models for Planning and Reasoning over Tables with Sentence Context
H-Star : LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables
Binder : Binding Language Models in Symbolic Languages

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
datasets		datasets
docs/images		docs/images
prompts		prompts
weaver		weaver
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
quickstart.py		quickstart.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Weaver: Interweaving SQL and LLM for Table Reasoning

Abstract

🚀 Quick Start

Installation

Setup Configuration

MySQL Setup

💡 Usage Examples

Single Question Answering

Batch Processing

Using with Context (FinQA Example)

Command Line Interface

📁 Dataset Structure

Question Dataset Format

🛠️ Configuration

Environment Variables Reference

Supported LLM Providers

🧪 Experiments & Results

🏗️ Architecture

🤝 Contributing

📝 Citation

🙏 Acknowledgments

📄 License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

rohitkhoja/weaver

Folders and files

Latest commit

History

Repository files navigation

Weaver: Interweaving SQL and LLM for Table Reasoning

Abstract

🚀 Quick Start

Installation

Setup Configuration

MySQL Setup

💡 Usage Examples

Single Question Answering

Batch Processing

Using with Context (FinQA Example)

Command Line Interface

📁 Dataset Structure

Question Dataset Format

🛠️ Configuration

Environment Variables Reference

Supported LLM Providers

🧪 Experiments & Results

🏗️ Architecture

🤝 Contributing

📝 Citation

🙏 Acknowledgments

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages