Hey! I'm Mani

I'm an AI Engineer

Artificial Intelligence | Machine Learning | Data Mining | Multimodal Generative AI | Backend Engineer

Contact Resume

About

My Introduction
Mani

I am deeply passionate about creating intelligent machines that enhance human capability and accessibility. With a strong background in artificial intelligence, machine learning, and data mining, I enjoy tackling complex challenges and transforming them into practical, human-centric solutions. My experience spans natural language processing, retrieval-augmented generation, and vector databases, where I focus on building systems that make a real-world impact. Through three internship experiences in the computer science domain and rigorous coursework and project work, I have honed my technical skills and problem-solving abilities. Driven by a commitment to continuous learning and hands-on development, I strive to design intelligent systems that empower individuals and promote inclusivity.

3+ Internships
3.7 GPA
20+ Projects

Skills

My Expertise
AI Agents
LLM Fine-tuning
Generative AI
RAG Systems
Knowledge Graphs
Machine Learning
Deep Learning
NLP
Computer Vision
Data Mining
Data Science
Data Analytics
Python
C++
SQL
Bash
PyTorch
TensorFlow
LangChain
Flask
Django
Hugging Face
Backend
GCP
AWS
Docker
Git

Experience

Professional and Academic Experience
Academic
Professional

Teaching Assistant for CS6220 Data Mining

Khoury College of Computer Sciences, Northeastern University
2025

MS in Artificial Intelligence

Foundations of AI, LLMs, Algorithms, Machine Learning | Northeastern University |Boston, USA
2023-Present

BTech in Computer Science (Spz AI&ML)

Software Engineering, NLP, AI, ML, Data Mining, OOP | UPES | Dehradun, India
2019-2023

AI Research Engineer

SignoFi
Sept 2024 - Jan 2025

Software Engineer - R&D

KEK
Aug 2022 - Aug 2023

Software Engineer Intern - Knowledge Graphs

Jio-Reliance
May 2022 - Jul 2022

Projects

Some of my projects
Clinical AI assistant

ClinAI: MCP based Agentic AI for healthcare Clinical assistant

ClinAI is an agentic AI system that automates the extraction, organization, and retrieval of clinical data from doctor-patient conversations using large language models. Built on the Model Context Protocol (MCP) framework, ClinAI enables seamless coordination between multiple AI agents and tools. It uses Google Gemini for post-session speaker role labeling and intelligent summarization of transcriptions. The workflow involves real-time audio capture, structured note generation, editable record validation, and persistent storage via MongoDB. ClinAI demonstrates the potential of Agentic AI to reduce administrative overhead, ensure data traceability, and deliver scalable, privacy-conscious healthcare automation.

GitHub
Machine Unlearning

Machine Unlearning: Erasing concepts from LLMs

In this project, we aim to enable large language models to selectively forget specific information without requiring full retraining. We compare three different approaches — LoRA-based fine-tuning, guardrailing using LLaMA-Guard, and Sparse Autoencoder (SAE)-based unlearning — to assess their effectiveness in targeted forgetting. The workflow includes curated dataset preparation, parameter-efficient training, evaluation through perplexity and retention metrics, and runtime safety checks. This work highlights practical, scalable techniques for efficient and legally compliant machine unlearning.

GitHub
BlindSight image

BlindSight: AI-Powered Voice-Driven File Navigation for the Visually Impaired

BlindSight is our innovative approach to making computer systems more accessible and intuitive for individuals with complete or partial visual impairment. Using just a single key, users can navigate the operating system and perform essential tasks like text-file editing and OS manipulation (e.g., changing directories, listing files, creating folders, etc.). Our mission is to foster inclusivity in the tech space and empower individuals with disabilities by providing them with tools for equal representation in the digital world.

GitHub
AI Grading Assistant

AI-Powered Grading Assistant

In this project, we develop an AI-powered grading assistant designed to automate score prediction and feedback generation for short answers. The system fine-tunes RoBERTa for numerical score prediction and BART for detailed feedback generation using parameter-efficient LoRA methods. The training process involves curated dataset preparation, hyperparameter tuning, and evaluation using metrics such as RMSE, Pearson correlation, and ROUGE scores. This work demonstrates how fine-tuned transformer models can streamline grading tasks by delivering accurate, explainable assessments at scale.

GitHub
RL Game image

Anakin Breaking Bad

Developed a fun game based on Star Wars. Used Reinforcement learning techniques like value iteration, Natural language processing, Text similarity and Pygame. Do check it out and give suggestions!

GitHub
VAE_GAN

DCGAN vs VAE - A comparison for Image Generation

This project explores Deep Convolutional Generative Adversarial Networks (DC- GANs) and Variational Autoencoders (VAEs) by developing and comparing them with the CIFAR-10 dataset. Utilizing the PyTorch library, these models are built from scratch to generate and reconstruct images. The study evaluates their perfor- mance using the Inception score (IS) and the Fréchet Inception Distance (FID).

View Paper
Memory icon

Memory Companion

MemoryCompanion is a tool for people suffering with Mild Cognitive Impairment (MCI), a condition in which users face memory or thinking related problems but not to the point that it affects their daily activities. These people tend to miss social events, forget appointments and have issues with remembering names or conversations with other people. MemoryCompanion helps these people keep track of conversations, names, tasks to do and much more!

GitHub
Stock bot icon

Surface Crack Detection

Built a surface crack detection web-app with a lightweight custom CNN to help civil inspectors inspect concrete surface quality in earthquake prone areas.

GitHub

Agent Navigation using reinforcement algorithms (comparative study)

Reinforcement learning has been widely used as a learning mechanism for an artificial life system. I built a project that compares path planning algorithms like Q- learning, SARSA, DQL (Path planning via deep reinforcement learning is an end-to-end method) in terms of optimality and time taken. I also visualize the path planned by each one of them using simple graphics.

GitHub

Want to collaborate?

Feel free to reach out if you're interested in collaborating on projects or research, particularly in the fields of AI, Data Science, ML, GenAI, or related areas!

Contact
Web Development vector art taken from internet symbolizing services that I do. Credits to original author

Blog

Explore My Medium Articles

Stay updated with my thoughts, insights, and research in AI, Data Science, ML, and more on Medium!

Visit My Blog

Contact

Get in touch with me

Email

mani.srinivasan2k1@gmail.com

Location

Boston, USA
×

AI Research Engineer Intern

At SignoFi in Raleigh, USA (Sep 2024 – Jan 2025), I worked as an AI Research Engineer Intern where I pioneered an Agentic AI application leveraging Llama 70B and LangChain to retrieve balance sheet data from NoSQL databases, enabling interactive queries and computational analysis for over 50,000 public companies. I also applied Retrieval-Augmented Generation (RAG) and fine-tuning with adaptive chunking strategies to ground Llama 70B and 90B, boosting the Faithfulness score to 0.84 (84%) and enhancing contextual precision, recall, and relevancy (92%) in factual question answering. In addition, I architected a custom workflow on Google Cloud Platform (GCP) using virtual machines, Vertex AI, GCP Buckets, and multi-core GPUs to optimize operations at a portfolio management firm, achieving up to 20% in cost savings.

×

Grid Computing Developer

Previously, at KEK in Tsukuba, Japan (Oct 2022 – Sep 2023), I served as a Grid Computing Developer for the prestigious BELLE-2 particle acceleration experiment. This international collaboration involved world-renowned laboratories such as Brookhaven, CERN, KEK, and over 127 institutions including Carnegie Mellon. I engineered innovative software tools that enabled physicists to execute experiments on over 10 PB of distributed data across global data centers. Additionally, I streamlined more than 50 issues on GitLab and JIRA by overhauling complex codebases and working closely with engineers from around the world to improve research efficiency and scalability.

×

Software Engineer Intern

Earlier, during my time at Jio in Mumbai, India (Jun 2022 – Nov 2022), I interned as a Software Engineer in the Center of Excellence for AI & ML. I contributed to the development of the BrainOS platform, integrating 7+ verticals across Reliance Industries using knowledge graphs and predictive machine learning, achieving 86% accuracy and 0.79 precision across products like JioTV, Hydrocarbons, and JioMart. I built robust knowledge graphs by preprocessing raw data and implemented over 100 CRUD API functions using gRPC to efficiently query extensive NoSQL databases, using debugging tools to refine 1000+ functions. As part of a cross-functional initiative, I also helped develop a beta version of a user-user collaborative filtering recommendation system using Neo4j and custom graph functions, contributing to a 10% revenue boost for Jio Platforms.

×

Teaching Assistant - Data Mining

As a Graduate Teaching Assistant for CS6220 Data Mining under Professor Dr. Sara Arunagiri, I was responsible for validating new assignment sets to ensure comprehensive coverage of key data mining concepts and coding practices. I actively engaged with students by answering technical queries on Microsoft Teams and Piazza, conducted office hours to provide in-depth explanations on a range of topics including classification, clustering, association rule mining, and data preprocessing, and played a key role in grading assignments and projects with a focus on evaluating both conceptual understanding and practical implementation. This role strengthened my expertise in data mining methodologies, algorithmic application, and student mentorship.

×

MS in Artificial Intelligence - Northeastern University

×

BTech in Computer Science - UPES

Skills: Creative Problem Solving · Algorithms · Machine Learning · C++ · Grid Computing · Java · Distributed Systems · Artificial Intelligence (AI) · Jira · Data Analysis · Object-Oriented Programming (OOP) · Supervised Learning · Data Structures · Natural Language Processing (NLP) · Software Development · Oral Communication · Python (Programming Language) · Databases