Real Industry Projects
Projects That Prove Your Skills
Every project is designed to mirror production-grade work at UK data and tech companies — not classroom exercises.
Production RAG Chatbot
Retrieval-Augmented Generation · NLP · Vector Search
PythonLangChainOpenAI APIPinecone / FAISSFastAPIDocker
What You Build
- ›End-to-end RAG pipeline ingesting PDFs and web content
- ›Vector store indexing with chunking and embedding strategies
- ›FastAPI backend with streaming response support
- ›Dockerised deployment with CI/CD pipeline
Deliverables
- GitHub repo with full README
- Architecture diagram (data flow + component map)
- Live demo endpoint
Skills Employers Assess
- ›LLM application design
- ›Prompt engineering
- ›Vector databases
- ›API development
- ›Containerisation
Azure Data Engineering Pipeline
Data Engineering · Cloud · ETL/ELT · Orchestration
Azure Data FactoryAzure DatabricksPySparkDelta LakeAzure Blob StoragedbtAirflow
What You Build
- ›Medallion architecture (Bronze → Silver → Gold) on Azure
- ›Incremental data ingestion from REST APIs and flat files
- ›PySpark transformations with data quality checks
- ›dbt models for Gold layer with documentation
Deliverables
- GitHub repo with IaC scripts
- Architecture diagram (Azure components)
- dbt docs site screenshot
Skills Employers Assess
- ›Cloud data architecture
- ›Spark processing
- ›Pipeline orchestration
- ›Data modelling
- ›Azure ecosystem
AWS ML Model Deployment
MLOps · Model Serving · Cloud Infrastructure
PythonScikit-learn / XGBoostMLflowAWS SageMakerS3LambdaTerraform
What You Build
- ›End-to-end ML training pipeline with experiment tracking via MLflow
- ›Model packaging and registration in SageMaker Model Registry
- ›Real-time inference endpoint with API Gateway + Lambda
- ›Infrastructure-as-Code with Terraform
Deliverables
- GitHub repo with model + IaC
- Architecture diagram (AWS services)
- Load-tested endpoint screenshot
Skills Employers Assess
- ›MLOps
- ›Model deployment
- ›AWS services
- ›Infrastructure as Code
- ›API design
BI Analytics Dashboard
Data Analytics · Visualisation · SQL · Reporting
SQLdbtPower BI / TableauPython (pandas)PostgreSQL / BigQuery
What You Build
- ›Star schema data model optimised for BI consumption
- ›dbt transformations with tests and documentation
- ›Executive-level dashboard with KPIs, trends, and drill-downs
- ›Automated data refresh pipeline
Deliverables
- GitHub repo (dbt project + SQL models)
- Dashboard screenshots
- Data model diagram
Skills Employers Assess
- ›Dimensional modelling
- ›SQL optimisation
- ›BI tooling
- ›Stakeholder reporting
- ›dbt
Start Building Your Portfolio
These projects are built inside the programme — with guidance, code reviews, and your name on every commit.
Apply Now