T his playbook is your guide to building a fully AI-powered analytics system that transforms raw business data into fast, actionable insights. By leveraging a modular βAI Data Science Teamβ of SQL and visualization agents, this repo makes it easy to:
Each chapter guides you through designing and wiring up modular agentsβso you can ask complex business questions in plain English and get back SQL, clean data, or beautiful visualizations instantly. Every step of the pipeline for your own business needs.
Kick off with automated machine learning: train and deploy an H2O AutoML model to score your leads, helping sales and marketing prioritize with real predictive power.
Use LangChain + OpenAI to ask questions in plain English and get optimized SQL queries with real answersβno manual SQL needed.
Build modular workflows as Directed Acyclic Graphs (DAGs), connecting agents together so your system is transparent, debuggable, and scalable.
Teach your agents to transform SQL results into Pandas DataFrames, and automatically decide when to display a chart or a table.
Let AI write and execute Python code that turns data into beautiful, interactive Plotly chartsβon the fly, based on the userβs intent.
Wrap up all agents and workflows into reusable Python modulesβplug and play for any future data pipeline or product.
Deliver a user-friendly, live dashboard with Streamlitβpowered by your agentic backend, with authentication, history, and live analytics.
See how specialized agents (SQL, EDA, Charting) work as a teamβready to be extended with more business logic, analytics, or LLM-powered features.
"This is super cool! I remember struggling to bridge that gap between insights and code. A tool that shows the 'how' is a total game-changer for learning." - Tran Tien Van
sql_agent_orchestration/
β
βββ 01_machine_learning_h2o.py
βββ 02_sql_agent.py
βββ 03_sql_agent_langgraph.py
βββ 04_add_pandas_langgraph.py
βββ 05_add_routing_langgraph.py
βββ 06_add_plotting_langgraph.py
βββ 07_modular_bi_agent.py
βββ 08_streamlit_app_persistant_plot_and_df.py
βββ 09_streamlit_app_bi_copilot.py
βββ 10_ai_data_science_team.py
β
βββ additional-requirements.txt
βββ credentials.yml
βββ README.md
β
βββ business_intelligence_agent/ # BI agent Python package
βββ database/
β βββ leads_scored.db # SQLite database
β βββ leads_scored.db-shm
β βββ leads_scored.db-wal
βββ models/ # ML models
βββ notebooks/ # Jupyter notebooks
βββ project2_architecture.drawio # Project architecture diagram
git clone https://github.com/LannonTheCannon/Cloudberry_AWS_Bootcamp.git
cd Cloudberry_AWS_Bootcamp/Portfolio_V2
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python3 app.py