Skip to content
View dipak0000812's full-sized avatar

Block or report dipak0000812

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dipak0000812/README.md

Portfolio LinkedIn Gmail LeetCode


Second-year AIML engineering student. I spend most of my time on backend systems - mainly Go and Python - and I'm trying to get better at the stuff that's hard to learn from tutorials: concurrency bugs, retry logic, database locking, that kind of thing. I also have some background in ML (scikit-learn, LSTMs, basic NLP with spaCy) from coursework and a couple of projects.


Projects

Orchestrix — Job Orchestration Engine

Go PostgreSQL Docker Prometheus

A job orchestration engine I built to actually run into and fix concurrency problems instead of just reading about them. It has a worker pool, a job state machine (PENDING → SCHEDULED → RUNNING → SUCCEEDED / FAILED / RETRYING), retries with exponential backoff, and a graceful shutdown that drains in-flight jobs.

A few specific bugs I hit and how I fixed them:

  • Duplicate job pickup: when I ran multiple scheduler instances, they'd sometimes grab the same job from the DB at the same time. Fixed it with SELECT FOR UPDATE SKIP LOCKED inside a transaction so each instance claims a non-overlapping set of jobs.
  • Jobs retrying forever for no reason: if a job's executor type wasn't registered, it would just keep retrying until it hit the max and fail silently - no useful error, just wasted cycles. I split errors into "retryable" vs "permanent" so missing executors and panics fail immediately instead of looping.
  • Import cycle: adding Prometheus metrics to the worker package created a cycle between worker and api. Pulled the metrics into their own package that both import separately.

v1 is functional locally with Docker Compose. Not deployed yet - planning to put it on Render and possibly look at swapping the DB-polling scheduler for a Redis or Kafka-backed queue later.


PRISM-AI — PR Risk Analysis Tool

Python FastAPI Next.js Groq

Built for the GitLab AI Hackathon (ZerothLayer). The idea: score how risky a pull request is before a human reviews it, without relying on an LLM to do the actual judgment.

How it works:

  • tree-sitter parses changed files (Python / JS / TS) into ASTs
  • NetworkX builds an import dependency graph and works out what else in the codebase could be affected by the change
  • A set of deterministic checks (PR size, file churn, whether auth/payment code was touched, test coverage delta, dependency depth, how familiar the author is with that part of the codebase) produce a 0-100 risk score
  • An LLM (Llama 3.3 via Groq) takes that score and writes a plain-language summary for the reviewer - it doesn't influence the score itself

The reasoning behind that order: I wanted the risk number to stay the same regardless of what the LLM outputs, so it's reproducible and you can sanity-check it without trusting the model.

Working code, runs locally. Haven't deployed it yet.


Open Source

Microcks OpenTelemetry kworkflow Layer5

  • Microcks — API mocking and contract testing; active contributions
  • OpenTelemetry — Contribution under review
  • kworkflow — Linux kernel developer workflow tooling
  • Meshery / Layer5 — Contributor, earned First Design badge
  • Hacktoberfest 2024 — 4 PRs merged

Technical Focus

Domain Stack
Languages Go · Python
Familiar With C · C++ · Rust
Backend REST APIs · FASTAPI . Worker pools · PostgreSQL · Docker · Prometheus
ML & AI Scikit-learn · LSTM · spaCy · Claude API

Right now

Deploying Orchestrix and PRISM-AI so they're not just "clone and run locally." After that, planning to look at Redis-backed queues for Orchestrix and see how it compares to the current DB-polling approach.

Popular repositories Loading

  1. microcks microcks Public

    Forked from microcks/microcks

    The open source, cloud native tool for API Mocking and Testing. Microcks is a Cloud Native Computing Foundation sandbox project 🚀

    Java 2

  2. meshery meshery Public

    Forked from meshery/meshery

    Meshery, the cloud native manager

    Go 2

  3. Stock-Price_Prediction Stock-Price_Prediction Public

    An AI-powered stock price forecasting dashboard using LSTM Deep Learning, 23 technical indicators, and real-time Yahoo Finance data via a Streamlit interface.

    Python 1

  4. TrustBite TrustBite Public

    TrustBite is a hyperlocal platform that helps students discover trusted mess and tiffin services using hygiene scores, reviews, and AI-based recommendations.

    JavaScript 1 2

  5. pgmpy pgmpy Public

    Forked from pgmpy/pgmpy

    Python library for Causal AI

    Python 1 1

  6. AIOD-rest-api AIOD-rest-api Public

    Forked from aiondemand/AIOD-rest-api

    Services for the core of AIoD: Authentication and the metadata catalogue with REST API.

    Python 1