Fine-Tuning Llama 3.2–3B on IPCC Climate Reports
Finetuning, Pretraining, and building a RAG pipeline using a Llama3.2-3B model on IPCC Climate Reports in a distributed environment
Finetuning, Pretraining, and building a RAG pipeline using a Llama3.2-3B model on IPCC Climate Reports in a distributed environment
Cloud-native ML pipeline with CNN training as a K8s batch job and scalable multi-replica FastAPI inference on GKE
Large-scale data lake pipeline integrating stock, COVID, mobility, and macroeconomic data via HDFS, MapReduce, and Hive/Trino for crash/recovery analysis
Robust, reproducible pipeline designed to fine-tune the DETR (DEtection TRansformer) model for the task of detecting moved objects in pairs of images taking from parking lots and intersections (VIRAT dataset).
Evaluated classical ML model performance and designed light-weight neural networks with custom loss functions for SoTA limit order book midpoint prediction for a cryptocurrency stock
Compact suite of operating system simulations in modern C++ spanning - linking/loading, CPU scheduling, memory paging, and disk I/O scheduling
C++/CUDA inference engine achieving 6.2× throughput via GPU-assisted scheduling on top of llama.cpp, without modifying model kernels
1st Place HackNYU 2025 (MLH) — Mobile app that scans clothing items and returns Eco-Scores with AI-powered greener alternatives
Two-tower neural recommender with cold-start handling, hot model reloading, and containerized FastAPI inference (<100ms latency)
LangGraph-orchestrated agentic interview system with voice interaction, document analysis, RAG over Reddit cases, and personalized PDF evaluations
ResNet50V2-based EMNIST classifier achieving 93.55% test accuracy with mixed precision and optimized data pipeline
Published in 20th International Conference on Security and Cryptography, SECRYPT 2023, 2023
We propose a context-aware behavioral fingerprinting of IoT devices that takes into account the circumstances or contexts under which the devices are operating. Our fingerprinting strategy uses supervised learning for classifying the IoT devices.Finally, Experimental results show that our fingerprinting technique is quite effective and is capable of identifying IoT devices with more than 94% accuracy.
Recommended citation: Prasad, A.; Biju, K.; Somani, S. and Mitra, B. (2023). Context-Aware Behavioral Fingerprinting of IoT Devices via Network Traffic Analysis. 20th International Conference on Security and Cryptography, SECRYPT 2023.
Download Paper | Download Bibtex
Published in Balisage: The Markup Conference 2023 — Balisage Series on Markup Technologies, Vol. 28, 2023
We introduce an early benchmark (Auto-Markup BenchMark) for evaluating automatic markup engines and propose XATER (XML Translation Edit Rate) alongside a validation-error metric to standardize comparisons across tools and tasks.
Recommended citation: Prescod, P.; Feuer, B.; Hladkyi, A.; Paulk, S.; Prasad, A. (2023). Auto-Markup BenchMark: towards an industry-standard benchmark for evaluating automatic document markup. Proceedings of Balisage: The Markup Conference 2023, Balisage Series on Markup Technologies, Vol. 28.
Download Paper | Download Bibtex
Architected multi-stage RAG pipelines and embedding infrastructure for 5.7M+ documents; reduced retrieval latency from 380ms to 130ms.
Built a metric-extraction + ML pipeline to flag bug-prone Java files with >91% accuracy; informed Agile policy updates to reduce post-release defects.
Built a traffic-analysis pipeline and models that identify IoT devices with >94% accuracy; published at SECRYPT 2023.
Benchmarking ML algorithms for image-change detection tasks, focused on identifying objects removed from vending machines.
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Vision Meets Machine Learning, New York University, Courant Institute of Mathematical Sciences, 2025
Course Assistant and Grader for the Vision Meets Machine Learning course under Prof. Davi Geiger.
Building AI-driven data enrichment pipelines, MCP tool integrations, and embedding infrastructure for an AI-native data platform.
Built marketing analytics and ETL pipelines for large scale data migrations; automated deployments; eliminated major outsourcing spend.
Developed analytical dashboards, OCR, and an early RAG assistant to automate ops and reduce drop-offs.