Data Engineering🔨 Personal Project
Real-Time Clickstream Analytics Platform
Production-ready clickstream analytics pipeline with Flask API, PostgreSQL storage, Grafana dashboards, and Python visualization for tracking user behavior in real-time with Docker containerization.
Project Overview
🎯
Objective
Build lightweight yet production-ready clickstream analytics system for tracking website user behavior without heavy infrastructure like Kafka/Spark
💼
My Role
Data Engineer - designed database schema, API endpoints, visualization layer, and Docker deployment
⏱️
Timeline
1 week
🛠️
Tech Stack
Python, Flask, PostgreSQL
📈
Key Results
- ✓Simple but complete pipeline: Website JS tracking → Flask API → PostgreSQL → Grafana/Python viz
- ✓RESTful API with /track endpoint, session tracking, timestamp capture, user agent logging
- ✓Multiple visualization options: Python terminal dashboard, Grafana interactive dashboards, API JSON stats
- ✓Production-ready: Docker Compose orchestration, proper schema with indexes, comprehensive documentation
Impact
Real-time event tracking
Value
Complete Docker deployment
📸
Visual Documentation
Screenshot 1
Screenshot 2
Screenshot 3
Tools & Technologies
PythonFlaskPostgreSQLGrafanaDockerNginxJavaScript