GitHub - RyanFernandes23/Text-to-SQL-agent

Text-to-SQL Agent Performance Documentation
Accuracy: 58.75%

Project Overview

A LangGraph-based agent converts natural language queries to SQL using Groq's Llama 3-70B and executes them against the Pagila database. The agent retries up to 3 times on errors, logs successful queries to output_queries.txt, and saves results to results.txt.

Setup Instructions

Prerequisites

Docker (for PostgreSQL container)

Python 3.9+ with libraries:

pip install langgraph psycopg2 python-dotenv groq

Groq API Key (set as environment variable GROQ_API_KEY)

Steps

Clone Pagila Database

git clone https://github.com/devrimgunduz/pagila.git
cd pagila

Start PostgreSQL with Docker

docker-compose up -d  # Launches Pagila database on port 5432

Initialize Database

docker exec -it pagila psql -U postgres -c "CREATE DATABASE pagila;"
docker exec -i pagila psql -U postgres pagila < pagila-schema.sql
docker exec -i pagila psql -U postgres pagila < pagila-data.sql

Configure Database Connection
In main.ipynb, update credentials:

import psycopg2
conn = psycopg2.connect(
    host="localhost",
    port=5432,
    user="postgres",
    password="postgres",  # Default Docker setup
    database="pagila"
)

Run the Agent
Execute main.ipynb to start processing natural language queries.

Test Results & Analysis

(Previous test cases and score breakdown remain unchanged from original documentation)

Architecture

Troubleshooting

Docker Connection Issues
- Verify container is running: docker ps -a
- Check logs: docker logs pagila
Empty Results
- Confirm table names/aliases match Pagila schema (e.g., city vs address.city_id).
Groq API Errors
- Ensure environment variable is set:
```
export GROQ_API_KEY="your-key-here"
```

Limitations & Future Work

Schema Awareness
Current agent lacks explicit knowledge of Pagila's table relationships (e.g., customer → address → city → country).
Error-Driven Retries
Future versions could parse PostgreSQL errors to guide SQL correction (e.g., missing column → suggest joins).
Performance Scaling
Test with larger datasets like Pagila's 16k+ rental records.

This enhanced documentation now provides full reproducibility while maintaining clarity about performance outcomes.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Readme.Md		Readme.Md
Results.txt		Results.txt
langgraph.png		langgraph.png
main.ipynb		main.ipynb
output_query.txt		output_query.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Overview

Setup Instructions

Prerequisites

Steps

Test Results & Analysis

Architecture

Troubleshooting

Limitations & Future Work

About

Uh oh!

Releases

Packages

Languages

RyanFernandes23/Text-to-SQL-agent

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Setup Instructions

Prerequisites

Steps

Test Results & Analysis

Architecture

Troubleshooting

Limitations & Future Work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages