Data Scientist & ML Engineer

Hi, I'm Kaivalya Dixit

MS in Data Science

A passionate Data Scientist and ML Engineer with expertise in Python, Machine Learning, and Data Analysis. Currently pursuing my Master's in Data Science at NJIT.

Download Resume
Profile

About Me

Who I Am

I am a Data Science graduate student at New Jersey Institute of Technology (NJIT) with a strong focus on high-performance computing, distributed systems, and machine learning. My experience spans from developing enterprise-scale ETL pipelines to managing HPC infrastructure and GPU computing environments.

Currently working as an HPC User Support Specialist, I help researchers optimize their computational workflows and manage advanced GPU clusters. My technical expertise includes distributed machine learning, data engineering, and system administration, with a particular interest in building scalable AI solutions and high-performance computing infrastructure.

Fun Facts

HPC Expertise

Specialized in GPU computing and high-performance computing infrastructure

Data Engineering

Built enterprise-scale ETL pipelines and data quality systems

Research & Development

Contributing to cutting-edge ML and distributed computing projects

Technical Skills

Python
Java
JavaScript
C/C++
HTML/CSS
Shell Scripting
LaTeX
TypeScript
PyTorch
TensorFlow
Scikit-learn
Keras
XGBoost
OpenMP
MPI
CUDA
LangChain
LlamaIndex
OpenAI Gym
NLP
Deep Learning
Neural Networks
Transformers
CNNs
RNNs (LSTM/GRU)
Transfer Learning
Ensembling
Multimodal Learning
AutoML
Time Series Analysis
Reinforcement Learning
SQL
PySpark
Hadoop
Pandas
NumPy
MapReduce
Hive
ETL/ELT Pipelines
Data Warehousing
Data Lake
Feature Engineering
Statistical Analysis
Big Data
Tableau
Matplotlib
Seaborn
Plotly
AWS
Docker
Apptainer
Git/GitHub
Databricks
MLflow
Weights & Biases
CI/CD
Containerization
React
Next.js
Node.js
Flask
Streamlit
Spring Boot
REST APIs
Tailwind CSS
Bootstrap
SLURM
PostgreSQL
MySQL
MongoDB
Redis
Data Modeling
Database Design
Query Optimization

Experience & Education

Experience
Mar 2024 - May 2025

High Performance Computing

HPC User Support Specialist, Student Intern

Newark, NJ

Support 400+ researchers with GPU/CPU performance tuning, containerization, and environment troubleshooting Developed an automated benchmark suite for node health using Slurm, Bash, and Python ...and 1 more achievements

SlurmBashPythonGPUInfiniBand+1
Education
Jan 2024 - May 2025

Master of Science in Data Science

New Jersey Institute of Technology

GPA: 3.95/4.0
Newark, NJ

Pursuing advanced studies in machine learning, data engineering, and statistical modeling

PythonMachine LearningData EngineeringStatistics
Experience
Jan 2023 - Dec 2023

Dassault Systems

Data Analyst Intern

Pune, India

Engineered Java-based ETL pipeline for Conversion Admin Service, processing enterprise-scale customer lifecycle data Designed interactive dashboard for license conversion tracking using internal visualization frameworks ...and 1 more achievements

JavaETLSQLData VisualizationData Quality
Certification
2022

Kaggle Introduction to Machine Learning

Kaggle

Online

Completed comprehensive machine learning course covering supervised learning algorithms

PythonScikit-learnPandasMachine Learning
View certificate
Education
Aug 2019 - May 2023

Bachelor of Technology in Electrical and Electronics Engineering

Mahindra Ecole Centrale

GPA: 3.6/4.0
Hyderabad, India

Comprehensive engineering program with focus on electrical systems, electronics, and programming

C/C++MATLABCircuit DesignSignal Processing

Blog

Insights and thoughts on data science, machine learning, and AI

Size vs. Compression Decoding the Future of Efficient AI – SLMs vs. Quantized Models
2025-10-06
Efficient_AISmall_Language_Models

Size vs. Compression Decoding the Future of Efficient AI – SLMs vs. Quantized Models

Small Language Models (SLMs) and Quantized Models offer two distinct but complementary paths to making AI more efficient—SLMs by designing smaller, leaner architectures and Quantized Models by compressing existing large models—both aiming to reduce resource demands without sacrificing performance. This contrast drives the future of deploying powerful AI on resource-limited devices.

Read Article
How Flow Garden Beat Browser Tab Throttling to Deliver Reliable Focus Timers
2025-09-01
blogproject

How Flow Garden Beat Browser Tab Throttling to Deliver Reliable Focus Timers

Discover how Flow Garden uses Web Workers, persistent timers, and smart recovery logic to overcome browser tab throttling and keep Pomodoro timers accurate—even when the app is running in the background.

Read Article
Meet Ziggy: How I Transformed My Old Gaming Laptop Into a Powerful Home Server
2025-07-27
serverubuntu

Meet Ziggy: How I Transformed My Old Gaming Laptop Into a Powerful Home Server

Meet Ziggy: a repurposed Lenovo Legion Y530 turned home server running Docker, CasaOS, and local AI, 11 days uptime and cost‑effective self‑hosting.

Read Article
Occam's Razor in Machine Learning: Why Simpler Systems Win in the Real World
2025-05-29
Machine LearningData Science

Occam's Razor in Machine Learning: Why Simpler Systems Win in the Real World

In machine learning, simplicity isn't just elegant—it's often the difference between a system that works in production and one that never leaves the lab.

Read Article

Get In Touch