Skip to main content

Project Gantt Chart

Overview

This document contains the comprehensive Gantt chart for the Ohpen Data Lake Project (Q2 2026). The chart visualizes all project phases, tasks, dependencies, and key milestones.

Project Start Date: March 1, 2026
Go-Live Date: May 23, 2026
Project Duration: 3 months

Interactive Gantt Chart

Phase Breakdown

Phase 1: MVP (Weeks 1-4)

Objective: Establish foundation and core ETL pipeline

TaskDurationStartEnd
Architecture Design7 daysMar 1Mar 7
Infrastructure Setup7 daysMar 8Mar 14
Requirements Review (Milestone)-Mar 14Mar 14
ETL Development14 daysMar 15Mar 28
Infrastructure Provisioning7 daysMar 22Mar 28
CI/CD Setup7 daysMar 29Apr 4
Bronze Layer Implementation28 daysMar 1Mar 28
Silver Layer Implementation21 daysMar 8Mar 28
CloudWatch Monitoring14 daysMar 15Mar 28
ETL Complete (Milestone)-Mar 28Mar 28
Schema Approval (Milestone)-Mar 28Mar 28

Deliverables:

  • ✅ Bronze layer (raw data ingestion)
  • ✅ Silver layer (validated Parquet)
  • ✅ Basic ETL pipeline
  • ✅ CloudWatch monitoring

Success Criteria:

  • ✅ Process 1 month of historical data
  • ✅ Quarantine invalid records
  • ✅ Basic reporting queries work

Phase 2: Production (Weeks 5-8)

Objective: Gold layer, SQL, and production readiness

TaskDurationStartEnd
SQL Development7 daysMar 29Apr 4
Athena Tables Setup7 daysApr 5Apr 11
SQL Testing7 daysApr 12Apr 18
Performance Validation7 daysApr 19Apr 25
Gold Layer Design (Task 2)7 daysApr 12Apr 18
SQL Aggregation Pattern (Task 3)7 daysApr 19Apr 25
Integration Testing7 daysApr 19Apr 25
Schema Versioning14 daysApr 5Apr 18
Governance Workflows14 daysApr 12Apr 25
Production Deployment7 daysApr 26May 2
UAT (Milestone)-Apr 25Apr 25

Deliverables:

  • ✅ Gold layer structure design (Task 2) + SQL aggregation pattern (Task 3)
  • ✅ Schema versioning
  • ✅ Governance workflows
  • ✅ Production deployment

Success Criteria:

  • ✅ Automated monthly reporting
  • ✅ Full audit trail
  • ✅ Schema evolution process

Phase 3: Rollout (Weeks 9-12)

Objective: Testing, training, and go-live

TaskDurationStartEnd
Production Rollout14 daysApr 26May 9
Monitoring Setup14 daysApr 26May 9
Training (Milestone)-May 9May 9
Final Testing (Milestone)-May 16May 16
Documentation7 daysMay 13May 19
Go-Live (Milestone)-May 23May 23

Key Activities:

  • User acceptance testing
  • Team training sessions
  • Final validation and testing
  • Production deployment
  • Go-live support

Phase 4: Optimization (Month 4+)

Objective: Post-launch improvements

TaskDurationStartEnd
Performance Tuning14 daysJun 1Jun 14
Cost Optimization14 daysJun 15Jun 28
Advanced Analytics14 daysJun 15Jun 28

Deliverables:

  • Performance tuning
  • Cost optimization
  • Advanced analytics

Key Milestones

MilestoneDateWeekStakeholder Involvement
M1: Requirements ReviewMar 14, 2026Week 2All: Review & approve requirements
M2: ETL CompleteMar 28, 2026Week 4Technical team validation
M3: Schema ApprovalMar 28, 2026Week 4Finance: Approve Gold schema
Legal: Approve security
M4: UATApr 25, 2026Week 8All: Test platform, validate reports
M5: TrainingMay 9, 2026Week 10Operations, Analysts: Attend training
M6: Final TestingMay 16, 2026Week 11All: Validate reports, practice processes
M7: Go-LiveMay 23, 2026Week 12All: Support first production run

Task Dependencies

Critical Path

  1. Architecture Design → Infrastructure Setup → Infrastructure Provisioning
  2. ETL Development → ETL Complete → Integration Testing
  3. SQL Development → SQL Testing → Performance Validation
  4. Gold Layer Design → SQL Aggregation Pattern → Integration Testing
  5. Integration Testing → UAT → Production Deployment → Go-Live

Parallel Workstreams

  • Infrastructure & ETL: Can run in parallel after architecture design
  • SQL & Gold Layer: Can run in parallel during Phase 2
  • Schema Versioning & Governance: Can run in parallel during Phase 2
  • Production Rollout & Monitoring: Run in parallel during Phase 3

Resource Allocation

Phase 1 (MVP)

  • Data Engineers: Architecture, ETL development, infrastructure
  • DevOps Engineers: CI/CD setup, infrastructure provisioning
  • Data Architects: Schema design, data lake architecture

Phase 2 (Production)

  • Data Engineers: SQL development, Gold layer design
  • QA Engineers: Testing, performance validation
  • Data Governance: Schema versioning, governance workflows

Phase 3 (Rollout)

  • All Teams: UAT, training, final testing
  • Operations: Production rollout, monitoring
  • Documentation: Technical documentation

Phase 4 (Optimization)

  • Data Engineers: Performance tuning, cost optimization
  • Analytics Team: Advanced analytics

Risk Mitigation Timeline

RiskMitigationTimeline
Schema changesSchema versioning processWeek 4-8
Performance issuesPerformance validationWeek 6
Integration failuresIntegration testingWeek 8
User adoptionTraining sessionsWeek 10
Production issuesMonitoring setupWeek 9-12

Viewing the Gantt Chart

This Gantt chart uses Mermaid syntax and can be viewed in:

  • GitHub (native support)
  • GitLab (native support)
  • VS Code with Mermaid extension
  • Online Mermaid editor: https://mermaid.live
  • Documentation sites (Docusaurus, MkDocs, etc.)

Updates

Last Updated: January 2026
Project Start Date: March 1, 2026
Owner: Data Platform Team


© 2026 Stephen AdeiCC BY 4.0