I'm Om Salunke

> |

A versatile software engineer specializing in building end-to-end ML pipelines, scalable cloud architectures, and robust data processing systems. Based in Buffalo, NY.

Technical Arsenal

>Programming Languages

Python
Java
C/C++
SQL
R
MATLAB
Golang
Solidity
Bash

>AI & Machine Learning

TensorFlow
Keras
PyTorch
XGBoost
pandas
scikit-learn
BERT
LSTM
Transformers
YOLO

>Web & Frameworks

React
Redux
Django
Spring Boot
Web3.js

>Cloud & DevOps

AWS
GCP
Azure
Docker
Kubernetes
GitHub Actions
Terraform
Databricks

>Data & Big Data

Databricks
Apache Kafka
Delta Lake
Snowflake
AWS S3
Apache Spark

Experience Timeline

Professional journey and contributions

Research Assistant

University at Buffalo

Buffalo, NY
March 2026 - Present
  • Used Python, AWS Bedrock, Excel and NLP to analyze and process complex medical data, implementing HL7 standards to ensure accurate data formatting and seamless EHR integration.
  • Developed complex SQL queries to extract and analyze EDI transaction data and patient records from Oracle and AWS S3 sources, centralizing refined data in Snowflake.
  • Architected a secure ETL workflow utilizing Databricks Delta Lake and Unity Catalog to extract, govern, and analyze EDI data while conducting HIPAA security risk assessments.

Software Engineer Intern

Tech Mahindra

Mumbai, India
December 2022 - June 2023
  • Engineered robust ETL pipelines using PySpark and Azure Databricks to ingest, clean, and transform large-scale datasets, significantly reducing data processing time.
  • Architected a scalable Lakehouse architecture utilizing Delta Lake and Azure Data Lake Gen2, optimizing data storage efficiency and ensuring ACID compliance.
  • Configured and managed scalable compute resources in Azure Databricks, including clusters and cluster pools, to optimize performance and reduce costs.
  • Implemented comprehensive data governance and access control solutions using Unity Catalog within the Databricks workspace.

Featured Projects

Showcasing innovative solutions and technical expertise

Ripple - AI Policy Simulator

A highly scalable data pipeline and simulation engine using Python, AWS, and Databricks for Monte Carlo simulations and predictive economic modeling.

Technologies

PythonAWSDatabricksPySparkReactJSGoogle Gemini API

Highlights

  • Scalable data pipeline
  • Interactive visualization dashboard
  • Monte Carlo simulations

Citi Bike Trip Prediction System

End-to-end ML pipeline forecasting hourly NYC Citi Bike demand using Python, Pandas, and LightGBM with MLflow experiment tracking.

Technologies

PythonLightGBMPandasMLflowDatabricksStreamlit

Highlights

  • Time-series forecasting
  • MLOps lifecycle
  • Batch prediction scheduling

Education & Credentials

education@om.dev
$ ./education.sh
> degree: Masters in Artificial Intelligence
> institution: University at Buffalo, NY
> completion_date: December 2025
> relevant_coursework:
- Database Systems (CSE562)
- Applied Machine Learning (CDA500)
- Data Intensive Computing (CSE587)
$ ✓ completed
education@om.dev
$ ./education.sh
> degree: B.E. in Computer Science and Engineering (Data Science)
> institution: University of Mumbai, India
> completion_date: May 2024
$ ✓ completed

Publications

publications@om.dev
$ ./publications.sh
> title: "Leveraging AI to Enhance Doctor Availability in Hospitals"
> conference: ICLMU - IJCA 2023
> read_paper: https://www.ijcaonline.org/proceedings/llmuc2023/number2/leveraging-ai-to-enhance-doctor-availability-in-hospitals/
$ ✓ published

Certifications & Badges

badges@om.dev
$ ./verify_credentials.sh
> Google Developer Badge> Google Cloud Badges
$ ✓ verified

Let's Connect

Open to exciting opportunities, collaborations, and discussions about AI, cloud architecture, and data engineering.

Send me an Email

Or reach out via any of the contact methods above. I'll get back to you within 24 hours.