T his playbook is your guide to building a fully AI-powered analytics system that transforms raw business data into fast, actionable insights. By leveraging a modular βAI Data Science Teamβ of SQL and visualization agents, this repo makes it easy to:
Each chapter guides you through designing and wiring up modular agentsβso you can ask complex business questions in plain English and get back SQL, clean data, or beautiful visualizations instantly. Every step of the pipeline for your own business needs.
Kick off with automated machine learning: train and deploy an H2O AutoML model to score your leads, helping sales and marketing prioritize with real predictive power.
Use LangChain + OpenAI to ask questions in plain English and get optimized SQL queries with real answersβno manual SQL needed.
Build modular workflows as Directed Acyclic Graphs (DAGs), connecting agents together so your system is transparent, debuggable, and scalable.
Teach your agents to transform SQL results into Pandas DataFrames, and automatically decide when to display a chart or a table.
Let AI write and execute Python code that turns data into beautiful, interactive Plotly chartsβon the fly, based on the userβs intent.
Wrap up all agents and workflows into reusable Python modulesβplug and play for any future data pipeline or product.
Deliver a user-friendly, live dashboard with Streamlitβpowered by your agentic backend, with authentication, history, and live analytics.
See how specialized agents (SQL, EDA, Charting) work as a teamβready to be extended with more business logic, analytics, or LLM-powered features.
"This is super cool! I remember struggling to bridge that gap between insights and code. A tool that shows the 'how' is a total game-changer for learning." - Tran Tien Van
sql_agent_orchestration/ β βββ 01_machine_learning_h2o.py βββ 02_sql_agent.py βββ 03_sql_agent_langgraph.py βββ 04_add_pandas_langgraph.py βββ 05_add_routing_langgraph.py βββ 06_add_plotting_langgraph.py βββ 07_modular_bi_agent.py βββ 08_streamlit_app_persistant_plot_and_df.py βββ 09_streamlit_app_bi_copilot.py βββ 10_ai_data_science_team.py β βββ additional-requirements.txt βββ credentials.yml βββ README.md β βββ business_intelligence_agent/ # BI agent Python package βββ database/ β βββ leads_scored.db # SQLite database β βββ leads_scored.db-shm β βββ leads_scored.db-wal βββ models/ # ML models βββ notebooks/ # Jupyter notebooks βββ project2_architecture.drawio # Project architecture diagram
git clone https://github.com/LannonTheCannon/Cloudberry_AWS_Bootcamp.git cd Cloudberry_AWS_Bootcamp/Portfolio_V2 python3 -m venv venv source venv/bin/activate pip install -r requirements.txt python3 app.py