Data Scientist & ML Engineer

Hi, I'm Kaivalya Dixit

MS in Data Science

A passionate Data Scientist and ML Engineer with expertise in Python, Machine Learning, and Data Analysis. Currently pursuing my Master's in Data Science at NJIT.

Download Resume

About Me

Who I Am

I am a Data Science graduate student at New Jersey Institute of Technology (NJIT) with a strong focus on high-performance computing, distributed systems, and machine learning. My experience spans from developing enterprise-scale ETL pipelines to managing HPC infrastructure and GPU computing environments.

Currently working as an HPC User Support Specialist, I help researchers optimize their computational workflows and manage advanced GPU clusters. My technical expertise includes distributed machine learning, data engineering, and system administration, with a particular interest in building scalable AI solutions and high-performance computing infrastructure.

Fun Facts

HPC Expertise

Specialized in GPU computing and high-performance computing infrastructure

Data Engineering

Built enterprise-scale ETL pipelines and data quality systems

Research & Development

Contributing to cutting-edge ML and distributed computing projects

Technical Skills

Python

Java

JavaScript

C/C++

HTML/CSS

Shell Scripting

LaTeX

TypeScript

PyTorch

TensorFlow

Scikit-learn

Keras

XGBoost

OpenMP

MPI

CUDA

LangChain

LlamaIndex

OpenAI Gym

NLP

Deep Learning

Neural Networks

Transformers

CNNs

RNNs (LSTM/GRU)

Transfer Learning

Ensembling

Multimodal Learning

AutoML

Time Series Analysis

Reinforcement Learning

SQL

PySpark

Hadoop

Pandas

NumPy

MapReduce

Hive

ETL/ELT Pipelines

Data Warehousing

Data Lake

Feature Engineering

Statistical Analysis

Big Data

Tableau

Matplotlib

Seaborn

Plotly

AWS

Docker

Apptainer

Git/GitHub

Databricks

MLflow

Weights & Biases

CI/CD

Containerization

React

Next.js

Node.js

Flask

Streamlit

Spring Boot

REST APIs

Tailwind CSS

Bootstrap

SLURM

PostgreSQL

MySQL

MongoDB

Redis

Data Modeling

Database Design

Query Optimization

Experience & Education

Experience

Mar 2024 - May 2025

High Performance Computing

HPC User Support Specialist, Student Intern

Newark, NJ

Support 400+ researchers with GPU/CPU performance tuning, containerization, and environment troubleshooting Developed an automated benchmark suite for node health using Slurm, Bash, and Python Provisioned 6 NVIDIA Grace Hopper nodes with InfiniBand, managing hardware setup and driver configurations

SlurmBashPythonGPUInfiniBandContainerization

Experience

Mar 2024 - May 2025

High Performance Computing

HPC User Support Specialist, Student Intern

Newark, NJ

SlurmBashPythonGPUInfiniBand+1

Education

Jan 2024 - May 2025

Master of Science in Data Science

New Jersey Institute of Technology

GPA: 3.95/4.0

Newark, NJ

Pursuing advanced studies in machine learning, data engineering, and statistical modeling

PythonMachine LearningData EngineeringStatistics

Education

Jan 2024 - May 2025

Master of Science in Data Science

New Jersey Institute of Technology

GPA: 3.95/4.0

Newark, NJ

Pursuing advanced studies in machine learning, data engineering, and statistical modeling

PythonMachine LearningData EngineeringStatistics

Experience

Jan 2023 - Dec 2023

Dassault Systems

Data Analyst Intern

Pune, India

Engineered Java-based ETL pipeline for Conversion Admin Service, processing enterprise-scale customer lifecycle data Designed interactive dashboard for license conversion tracking using internal visualization frameworks Automated data quality checks streams using SQL window functions and constraint validation

JavaETLSQLData VisualizationData Quality

Experience

Jan 2023 - Dec 2023

Dassault Systems

Data Analyst Intern

Pune, India

JavaETLSQLData VisualizationData Quality

Certification

2022

Kaggle Introduction to Machine Learning

Kaggle

Online

Completed comprehensive machine learning course covering supervised learning algorithms

PythonScikit-learnPandasMachine Learning

View certificate

Certification

2022

Kaggle Introduction to Machine Learning

Kaggle

Online

Completed comprehensive machine learning course covering supervised learning algorithms

PythonScikit-learnPandasMachine Learning

View certificate

Education

Aug 2019 - May 2023

Bachelor of Technology in Electrical and Electronics Engineering

Mahindra Ecole Centrale

GPA: 3.6/4.0

Hyderabad, India

Comprehensive engineering program with focus on electrical systems, electronics, and programming

C/C++MATLABCircuit DesignSignal Processing

Education

Aug 2019 - May 2023

Bachelor of Technology in Electrical and Electronics Engineering

Mahindra Ecole Centrale

GPA: 3.6/4.0

Hyderabad, India

Comprehensive engineering program with focus on electrical systems, electronics, and programming

C/C++MATLABCircuit DesignSignal Processing

Featured Projects

FairShare

Smart bill splitting app for roommates. Upload receipts or PDFs, AI extracts items using Ollama/Perplexity, each person selects what they had, and penny-perfect math calculates who owes what. Features shareable links, lock/unlock bills, and a dashboard for bill history.

Next.jsTypeScriptPrismaTailwind CSSOllamaDocker

Ziggy Bot

Browser-based AI companion chatbot with intelligent task management and NLP capabilities. Features a hybrid AI architecture combining Perplexity Pro API with local LLM deployment, mobile-first PWA design, and Docker containerization for seamless deployment.

TypeScriptReactMongoDBNode.jsDockerLLM

Quantitative Portfolio Simulator

Comprehensive stock portfolio simulation system implementing multiple trading strategies (Buy High, Buy Low, Buy Rising). Features dividend detection, mark-to-market calculations, weekly rebalancing, and performance benchmarking against a custom high-tech index.

PythonPandasNumPyMatplotlibJupyterFinance

Blog

Insights and thoughts on data science, machine learning, and AI

Size vs. Compression Decoding the Future of Efficient AI – SLMs vs. Quantized Models

2025-10-06

Efficient_AISmall_Language_Models

Size vs. Compression Decoding the Future of Efficient AI – SLMs vs. Quantized Models

Small Language Models (SLMs) and Quantized Models offer two distinct but complementary paths to making AI more efficient—SLMs by designing smaller, leaner architectures and Quantized Models by compressing existing large models—both aiming to reduce resource demands without sacrificing performance. This contrast drives the future of deploying powerful AI on resource-limited devices.

Read Article

How Flow Garden Beat Browser Tab Throttling to Deliver Reliable Focus Timers

2025-09-01

blogproject

How Flow Garden Beat Browser Tab Throttling to Deliver Reliable Focus Timers

Discover how Flow Garden uses Web Workers, persistent timers, and smart recovery logic to overcome browser tab throttling and keep Pomodoro timers accurate—even when the app is running in the background.

Read Article

Meet Ziggy: How I Transformed My Old Gaming Laptop Into a Powerful Home Server

2025-07-27

serverubuntu

Meet Ziggy: How I Transformed My Old Gaming Laptop Into a Powerful Home Server

Meet Ziggy: a repurposed Lenovo Legion Y530 turned home server running Docker, CasaOS, and local AI, 11 days uptime and cost‑effective self‑hosting.

Read Article

Occam's Razor in Machine Learning: Why Simpler Systems Win in the Real World

2025-05-29

Machine LearningData Science

Occam's Razor in Machine Learning: Why Simpler Systems Win in the Real World

In machine learning, simplicity isn't just elegant—it's often the difference between a system that works in production and one that never leaves the lab.

Read Article

Hi, I'm Kaivalya Dixit

MS in Data Science

About Me

Who I Am

Fun Facts

HPC Expertise

Data Engineering

Research & Development

Technical Skills

Experience & Education

High Performance Computing

HPC User Support Specialist, Student Intern

High Performance Computing

HPC User Support Specialist, Student Intern

Master of Science in Data Science

New Jersey Institute of Technology

Master of Science in Data Science

New Jersey Institute of Technology

Dassault Systems

Data Analyst Intern

Dassault Systems

Data Analyst Intern

Kaggle Introduction to Machine Learning

Kaggle

Kaggle Introduction to Machine Learning

Kaggle

Bachelor of Technology in Electrical and Electronics Engineering

Mahindra Ecole Centrale

Bachelor of Technology in Electrical and Electronics Engineering

Mahindra Ecole Centrale

Featured Projects

FairShare

Ziggy Bot

Quantitative Portfolio Simulator

Blog

Size vs. Compression Decoding the Future of Efficient AI – SLMs vs. Quantized Models

How Flow Garden Beat Browser Tab Throttling to Deliver Reliable Focus Timers

Meet Ziggy: How I Transformed My Old Gaming Laptop Into a Powerful Home Server

Occam's Razor in Machine Learning: Why Simpler Systems Win in the Real World

Get In Touch