CV

Arjun Parasuram Prasad

ap9334@nyu.edu

+1-914-336-0944

https://prasadboi.github.io

New York, , US

LinkedIn GitHub

Summary

MS CS @ NYU | SWE Intern @ Corvic AI | ex-Providence, SuperPe

Education

Master of Science in Computer Science
2026-05-01
NYU, Courant Institute of Mathematical Sciences
GPA: 3.85/4
Courses: Machine Learning, Computer Vision, Big Data & ML Systems, GPU Programming, Algorithms, Operating Systems
Bachelor of Engineering (Computer Science)
2023-06-01
BITS Pilani — Hyderabad Campus, India
GPA: 8.65/10
Courses: Deep Learning, NLP, Data Structures & Algorithms, Database Systems, Probability & Statistics

Work Experience

Software Engineer Intern, AI/ML
2026-02-01 -
Corvic AI
Built AI-driven data enrichment pipeline (Web Augment) with Temporal workflows, batched execution, and row-level retry (~2,500 lines across 28 files). Redesigned DataApp embedding API with dynamic dimension resolution via HuggingFace AutoConfig. Implemented full-stack MCP tool integration for AI agent platform (~1,500+ lines across ~20 files).
Data Analytics Engineer
2023-06-01 - 2024-08-01
Providence Health Care
Engineered ETL pipelines using Azure Data Factory for an in-house data migration tool (Ultrasonic), handling 10M+ row datasets across diverse source/target systems. Partnered with marketing stakeholders to build a Snowflake-based analytics framework; integrated advanced stored procedures with Power BI dashboards to internalize campaign KPI tracking, eliminating >$2M in outsourcing costs. Shipped CI/CD pipelines to automate SQL script deployments, reducing manual effort by ~60 hours/month.
Machine Learning Intern
2023-01-01 - 2023-06-01
SuperPe.in
Built analytical dashboards to track merchant onboarding and financial KPIs; identified bottlenecks that reduced drop-offs by 40%. Developed an OCR pipeline to extract/validate merchant document numbers with 97% accuracy (Google Vision + OpenCV), removing manual entry; deployed via AWS Lambda and Amazon EFS. Created a RAG-based conversational agent (LangChain + Milvus) to automate customer query resolution via agent–tool workflows.

Skills

Languages

Python, C++, C, SQL, Java, TypeScript, JavaScript

ML / AI Engineering

PyTorch, Hugging Face, vLLM, DeepSpeed, LangChain, LangGraph, CUDA, OpenCV, scikit-learn, TensorFlow, Search & Ranking, RAG, Vector DBs (Milvus, FAISS), Distributed Training/Inference, LLMs

Data Engineering

Snowflake, PostgreSQL, Polars, PyArrow, HDFS, MapReduce, Hive, Trino, Kafka, Azure Data Factory, Power BI, ETL, CI/CD

Infrastructure & DevOps

Docker, Kubernetes (GKE), AWS (Lambda, S3, EFS), GCP (Vertex AI, Artifact Registry), Singularity, SLURM, gRPC, Protobuf, Temporal, FastAPI, Git, BAML, MCP

Publications

Context-Aware Behavioral Fingerprinting of IoT Devices via Network Traffic Analysis
2023
20th International Conference on Security and Cryptography (SECRYPT 2023)
View Publication
Auto-Markup Benchmark: Towards an Industry-standard Benchmark for Evaluating Automatic Document Markup
2023
Balisage: The Markup Conference 2023
View Publication

Portfolio

CoBaLI – Continuous Batching for LLM Inference
2025
Portfolio
Developed C++/CUDA inference engine on top of llama.cpp for Qwen2.5-0.5B-Instruct on RTX 4070. Implemented sequential serving, continuous batching, and chunked prefill. Achieved 6.2× throughput improvement (154s→25s) via GPU-assisted scheduling without modifying model kernels.
View Project
EcoScan – Sustainable Fashion Scanner (1st Place HackNYU 2025)
2025
Portfolio
Built mobile app scanning clothing for Eco-Scores (0–100) with AI explanations and greener alternatives. React Native/Expo frontend, FastAPI backend, Supabase datastore. Integrated Gemini 2.5-flash, Cloud Vision OCR, and Lykdat API. Won 1st place in HackNYU 2025 Sustainability Track (MLH).
View Project
RepoRecSys – GitHub Repository Recommendation System
2025
Portfolio
Two-tower neural recommender with 64d embeddings, InfoNCE contrastive loss, cold-start handling via Pub/Sub, and hot model reloading from GCS. Containerized FastAPI inference with <100ms median latency.
View Project
Fine-Tuning Llama 3.2–3B on IPCC Climate Reports
2025
Portfolio
Applied 4-bit quantization, BF16, and LoRA (r=8, α=32) with gradient checkpointing. Orchestrated data/tensor/pipeline parallelism across 2×A100s; doubled throughput (337→169 min) and cut per-GPU memory by ~48%.
View Project
MOT-TR — Fine-Tuning DETR for Moved-Object Detection
2025
Portfolio
Fine-tuned DETR for scene-change analysis in image pairs. Reduced eval loss from 2.38 to 0.60 via 3-stage training with progressive unfreezing and cosine annealing.
View Project
Limit Order Book Midpoint Prediction for Crypto
2024
Portfolio
Custom neural-SVR with dynamically scaled ε-insensitive loss and trend-alignment penalty. MSE 28.05, R² 0.998. Benchmarked KNN, SVR, MLP, LSTM, CNN+LSTM, CNN+SVR.
View Project
OS Simulation Suite
2024
Portfolio
Four self-contained OS kernel-concept simulators in C++ covering linking, CPU scheduling (FCFS/SJF/SRTF/RR/Priority), virtual memory (FIFO/Clock/Aging/Working-Set), and disk I/O scheduling (FIFO/SSTF/LOOK/C-LOOK/FLOOK).
View Project
VisaPulse – AI-Powered F-1 Visa Mock Interview
2025
Portfolio
LangGraph state machine orchestrating voice-based mock interviews with RAG over Reddit cases, document analysis, and personalized PDF evaluations via reportlab.
View Project

Download CV as PDF View Markdown CV