// work
transformer-servexProduction KV cache optimization · MoE routing · IO-aware attention for long-context LLMs |
cuda-netoptML-driven TCP/UDP packet scheduling · DQN network routing · CUDA queue scoring |
AeroMimicBehavior cloning from expert pilots · real-time MAV autonomy · onboard inference stack |
aerosurrogate-control-stackCFD surrogate modeling · constrained optimization · robustness replacing FEM solvers |
// research
satellite telemetry anomaly detection100K telemetry readings · 5 NASA/ESA fault modes · recurrence-plot CV · 0.91 F1 on Kepler-class wheel oscillation PDF · repo |
bell labs ml impact analysis71-paper corpus · semantic clustering · co-authorship networks · Gradient Boosting AUC 0.674 · SHAP attribution PDF · repo |
// open sourceNVIDIA/cuda-python#2087FIPS-safe hashes for program cache keys |
NVIDIA/cuda-quantum#4688nvqpp: discriminate measured-register bool iteration |
huggingface/accelerate#4054Aggregate profiler memory example |
Dao-AILab/flash-attention#2622weights_only=True across all torch.load sites |
ai-dynamo/dynamo#10281HTTP 415 for unsupported image formats |
linkedin/Liger-Kernel#1157Guard save_for_backward on grad_bias in fused linear CE |
// stack// metrics


