Real Industry Projects

Projects That Prove Your Skills

Every project is designed to mirror production-grade work at UK data and tech companies — not classroom exercises.

Production RAG Chatbot

Retrieval-Augmented Generation · NLP · Vector Search

PythonLangChainOpenAI APIPinecone / FAISSFastAPIDocker

What You Build

  • End-to-end RAG pipeline ingesting PDFs and web content
  • Vector store indexing with chunking and embedding strategies
  • FastAPI backend with streaming response support
  • Dockerised deployment with CI/CD pipeline

Deliverables

  • GitHub repo with full README
  • Architecture diagram (data flow + component map)
  • Live demo endpoint

Skills Employers Assess

  • LLM application design
  • Prompt engineering
  • Vector databases
  • API development
  • Containerisation

Azure Data Engineering Pipeline

Data Engineering · Cloud · ETL/ELT · Orchestration

Azure Data FactoryAzure DatabricksPySparkDelta LakeAzure Blob StoragedbtAirflow

What You Build

  • Medallion architecture (Bronze → Silver → Gold) on Azure
  • Incremental data ingestion from REST APIs and flat files
  • PySpark transformations with data quality checks
  • dbt models for Gold layer with documentation

Deliverables

  • GitHub repo with IaC scripts
  • Architecture diagram (Azure components)
  • dbt docs site screenshot

Skills Employers Assess

  • Cloud data architecture
  • Spark processing
  • Pipeline orchestration
  • Data modelling
  • Azure ecosystem

AWS ML Model Deployment

MLOps · Model Serving · Cloud Infrastructure

PythonScikit-learn / XGBoostMLflowAWS SageMakerS3LambdaTerraform

What You Build

  • End-to-end ML training pipeline with experiment tracking via MLflow
  • Model packaging and registration in SageMaker Model Registry
  • Real-time inference endpoint with API Gateway + Lambda
  • Infrastructure-as-Code with Terraform

Deliverables

  • GitHub repo with model + IaC
  • Architecture diagram (AWS services)
  • Load-tested endpoint screenshot

Skills Employers Assess

  • MLOps
  • Model deployment
  • AWS services
  • Infrastructure as Code
  • API design

BI Analytics Dashboard

Data Analytics · Visualisation · SQL · Reporting

SQLdbtPower BI / TableauPython (pandas)PostgreSQL / BigQuery

What You Build

  • Star schema data model optimised for BI consumption
  • dbt transformations with tests and documentation
  • Executive-level dashboard with KPIs, trends, and drill-downs
  • Automated data refresh pipeline

Deliverables

  • GitHub repo (dbt project + SQL models)
  • Dashboard screenshots
  • Data model diagram

Skills Employers Assess

  • Dimensional modelling
  • SQL optimisation
  • BI tooling
  • Stakeholder reporting
  • dbt

Start Building Your Portfolio

These projects are built inside the programme — with guidance, code reviews, and your name on every commit.

Apply Now