Divyanshi Kashyap

AI & Civic Tech

CalgaryPulse

AI civic intelligence platform tackling Calgary's 30.4% downtown vacancy crisis. MindFuel Tech Futures 2026 finalist (31 projects, 7 provinces); seed funding approved.

ReactThree.jsFastAPIPostGISCrewAI

Case study Code

ML & Data

InsightEngine

Production-grade ML pipeline for VLT game performance prediction. End-to-end from Snowflake ingestion to deployed explainable prediction system. CatBoost regressors with R²=0.90 profitability, ~130K processed rows, Dockerized on EC2 with weekly CI/CD.

CatBoostSnowflakeDockerSHAPStreamlit

Read full report

AI Agents

TravCan: AI Travel Platform

Production AI travel platform with 3 services (React 18, Rust/Axum, Python CrewAI), 17+ DB tables, 8 API integrations, 11-agent AI orchestration, 10-state flight booking FSM with Stripe payments, Redis caching (6 layers), 334+ automated tests, serving live users at travcan.ca.

ReactRust/AxumCrewAISupabaseRedisStripe

Case study Code

AI Agents

Monitor Lizard

Autonomous co-op job tracking agent on OpenClaw. Nightly Noctis Mode scans 60+ portals with A to F scoring, critic-review pass, and ChromaDB vector memory. 172 tests.

Claude APIChromaDBOpenClawPython

Read more Code

Full Stack

NyxLink

Production-grade URL shortener with AI phishing detection (Google Safe Browsing), real-time bot classification, Redis caching (<2ms p50), and tiered rate limiting.

FastAPIPostgreSQLRedisDocker

Demo Code

AI Agents

Kaashvi

Desktop-native ReAct agent, a personal AI chief of staff. Autonomously plans and executes across Google Calendar and Notion with a multi-step reasoning loop.

ElectronReactGoogle OAuth2Notion API

Read more Code

DevTools

NightShade

Adversarial red-teaming framework targeting OWASP LLM Top 10: LLM01 prompt injection, LLM06 disclosure, LLM07 insecure plugins, LLM08 excessive agency. Applied to TravCan security hardening at production.

PythonOWASP LLMAnthropic SDK

Read more Code

DevTools

CarbonLedger

Cross-platform C++17 library measuring real-time CPU & memory consumption, converting telemetry to CO₂ estimates via the Green Software Foundation SCI formula. 85% coverage CI gate.

C++17CMakeGCC/Clang/MSVCValgrind

Read more Code

Layer	Technologies
Languages	Python 3.11, SQL
Data Processing	pandas, NumPy, SciPy, scikit-learn
ML Frameworks	CatBoost, XGBoost, LightGBM, PyTorch
Explainability	SHAP (TreeSHAP via CatBoost)
Databases	MySQL, Snowflake, ChromaDB
Frontend	Streamlit, Plotly
Infrastructure	Docker, Nginx, AWS EC2, S3, Bedrock
CI/CD	GitHub Actions (13 workflows), self-hosted EC2 runner
Experiment Tracking	MLflow with S3 backend

Metric	Value
Jurisdictions	9 (AGLC, ALC, MBLL, SD, OSL, SEJQ, Sweden, WCLC, Italy)
Countries	4 (Canada, USA, Sweden, Italy)
Currencies	4 (CAD, USD, SEK, EUR) — all normalized to USD
Unique Games	~600
Weekly Performance Rows	~130,000
Trained ML Models	41
Prediction Rows	780 (game × region combinations)

Model Category	Count	Features	Performance
Shape classifier	1	22 char-only	Accuracy 58%, F1 0.53
Profitability (full features)	10	34 (char + perf)	R² 0.77–0.90
Risk (full features)	10	34 (char + perf)	R² 0.13–0.66
Profitability (char-only)	10	18 char-only	R² 0.40–0.65
Risk (char-only)	10	18 char-only	Intentionally lower

Decision	Academic Source
Cold-start content-based fallback	Schein et al. (2002), ACM SIGIR
Similarity imputation over regression	Razavi-Far et al. (2021), PeerJ CS
Gower coefficient for mixed types	Gower (1971), Biometrics
Feature-importance weighting	Wilson & Martinez (1997), JAIR
Mandatory match constraints	Richter & Weber (2013), Springer
Threshold-based neighbor selection	Anagnostopoulos et al. (2024), IJDSA
Adaptive K with dilution control	Desrosiers & Karypis (2011), Springer

Tier	When	Data Source	Confidence
Tier 1 (Exact Match)	Game exists in DB with perf data	Real NTI from silver tables	HIGH
Tier 2 (Similar Match)	Similar game(s) found ≥ 70%	Weighted proxy from KNN neighbors	MEDIUM
Tier 3 (No Match)	No similar game found	Characteristics only	LOW

#	Workflow	Purpose
1	extract.yml	Snowflake → MySQL extraction
2	dq.yml	Data quality + exchange rates + currency normalization
3	mapping.yml	Game key mapping + dimension export + CSV reports
4	weekly.yml	Weekly processing + sequences + basic verification
5	ml.yml	Full ML pipeline with dropdown selection
6	predict_game.yml	Single-game 3-tier prediction
7	predict_regional.yml	Cross-region comparison with job summary
8	full-pipeline.yml	End-to-end chain, scheduled every Sunday 2 AM UTC
9	verify.yml	6 structural integrity checks
10	validate.yml	12 deep accuracy checks
11	dashboard_data.yml	Build dashboard JSON/parquet artifacts + S3 upload
12	deploy.yml	Docker build + deploy to EC2 + health check
13	game_bridge.yml	Qual↔quant name matching with supervisor approval

Document	Lines	Content
PIPELINE_GUIDE.md	1,787	Complete system design: architecture, 23-table DB schema, file-by-file ETL reference
VLT_PIPELINE_GUIDE.md	586	Condensed technical guide: all stages, workflows, Docker, local running
EC2_DEPLOYMENT_GUIDE.md	469	SSH, Docker operations, dev mode, troubleshooting, AWS credentials
INTEGRATION_PLAN.md	762	7-phase build plan covering IGT Sections A–I
XAI_INTEGRATION_GUIDE.md	770	API documentation, data schemas, join keys, error handling
similarity_academic_foundations.md	368	13 academic papers mapped to design decisions and code locations

What I'm up to

AI & Tech Team Lead

UNB Study Term

GSSoC 2026

BuildersLab

Things I've actually shipped

CalgaryPulse

InsightEngine

TravCan: AI Travel Platform

Monitor Lizard

NyxLink

Kaashvi

NightShade

CarbonLedger

A short resume in motion

Open Source Contributor

ML Engineer, Cohort Member

AI & Tech Team Lead

AI/MLOps Engineer Intern

Teaching Assistant, Calculus

A bit about me

Black cat era

17hrs/day in headphones

Caffeine + anxiety

Saving for a motorbike

Things I've written about

InsightEngine: Building a Full-Stack ML Pipeline for VLT Game Prediction at IGT

Monitor Lizard: Using OpenClaw for the First Time, I Loved It

Let's build somethingweird together.

★ 101 REASONS TO LIVE ★

☆ TRAVEL ☆

☆ ADVENTURE ☆

☆ LEARN ☆

☆ EXPERIENCES ☆

☆ HEART ☆

♡ VIBES & PINTEREST BOARD ♡

📌 my pinterest

POLITICS TO LIFELONG FRIENDSHIP

RASH DRIVING

MUSIC IS EVERYTHING

InsightEngine

Project Background

What is InsightEngine?

Technical Architecture

Technology Stack

Data Coverage

ETL Pipeline — Data Ingestion & Transformation

Stage 1: Snowflake Extraction

Stage 2: Exchange Rate Loading

Stage 3: Data Quality & Currency Normalization

Stage 4: Game Key Mapping

Stage 5: Weekly Processing

Stage 6: Sequence Building

Stage 7: Verification

Database Schema

Machine Learning Pipeline — 41 Trained Models

Preprocessing

Clustering

Shape Classification

Model Training

Similarity Engine — Academically-Backed Game Matching

Scoring Algorithm — Weighted Gower Coefficient

Adaptive Weighted KNN

Key Design Decisions (Paper-Backed)

Three-Tier Prediction System

Critical Bug Fix — KENO/BINGO Scoring

Testing & Validation Framework

Unit Tests (60+ tests, 6 files)

Pipeline Verification (18 checks)

CI/CD Automation — 13 GitHub Actions Workflows

Production Deployment on AWS EC2

Docker Containerization

ChromaDB Persistence Challenge

S3 Integration

Dashboard Data Layer & Integration

Key Technical Challenges

Cross-Jurisdiction Data Normalization

Game Name Disambiguation

Time Series Segmentation

Let's build something
weird together.