Yugam Padha

About Me

I’m Yugam Padha, an AI researcher and engineer passionate about building intelligent systems that solve real-world problems. My work spans machine learning, explainable AI (XAI), and advanced gas-sensing research, where I combine strong theoretical foundations with hands-on engineering to create models that are both powerful and interpretable.

Over the years, I’ve developed end-to-end AI pipelines for industrial gas sensing — from embedded hardware and firmware integration to data acquisition, deep learning modelling, and deployment. I also built SensorLab, a PyQt6-based desktop application that interfaces with STM32 hardware, ADS1220 ADCs, and temperature/humidity sensors, enabling real-time analytics and ML inference.

Beyond applied engineering, I actively research psychological AI, using SHAP, LIME, and hybrid ensemble architectures to create interpretable models that explore human-centered constructs such as depression, personality, and well-being. My published and ongoing research focuses on AI systems that are transparent, reliable, and grounded in domain expertise.

I’m currently preparing for a PhD in Computer Science.

I’m driven by the belief that impactful AI requires a balance of innovation, interpretability, and meaningful application. Whether advancing gas-sensing technologies or designing interpretable psychological models, my goal is to create AI systems that contribute to science, society, and industry.

If my work aligns with your vision, I’d love to connect.

Research Interests

Machine Learning

Edge-AI

Smart Sensor System

IoT

Algorithms

Hybrid & Ensemble Learning Architectures

Explainable Artificial Intelligence

AI enabled wireless systems

Human-Centered AI Systems

Scalable AI Infrastructure

Data Mining

Education

MCA – University of Jammu

Nov 2023– Oct2025 | CGPA: 8.8 (Second In Class)

Thesis: Predictive Model for Detection of Psychological Distress Using Machine Learning and Psychometric Analysis.

B.Sc. – Cluster University of Jammu

Jul 2020– Sept 2023 | CGPA: 6.4

Majors: Physics, Chemistry, Mathematics.

Industrial-Grade Deep Learning for Gas Sensing: Robust Calibration, Domain Adaptation & Real-Time Deployment

Status: Ongoing — data curation, model development & firmware integration

More Information (click to expand)

This paper develops an industrial-grade deep learning pipeline for concentration prediction and gas classification from multi-condition sensor streams. The work emphasizes robust calibration under temperature/humidity drift, domain adaptation across sensors and environments, and real-time inference integration with the SensorLab application and embedded STM32 pipeline.

Curating heterogeneous datasets across concentration ranges, gas types, temperatures and humidity; implementing systematic data augmentation and sensor noise models.
Designing a hybrid model (signal-processing front-end + deep regression/classifier backbone) for accurate concentration estimation and gas ID under drift.
Applying domain adaptation & transfer learning to generalize across sensor batches and deployment sites (unsupervised & few-shot strategies).
Temperature/humidity compensation modules and uncertainty-aware predictions (calibrated confidence intervals for actionable alerts).
Edge-ready model optimization (quantization, pruning), and integration with SensorLab (PyQt6) + STM32 firmware (CDC serial) for real-time inference and CSV/XLSX export.
Robustness evaluation: adversarial noise, long-term drift tests, and cross-validation across demographic/environmental stratifications where applicable.

Target outcome: a reproducible pipeline and open dataset splits plus a production-ready reference implementation that demonstrates reliability for lab → field transition.

Methods — Sparse SHAP

Sparse SHAP: Fast Approximate SHAP Calculation for Large Tabular Models

Status: Ongoing — method development & benchmarking

More Information (click to expand)

Sparse SHAP proposes a fast, approximate approach to computing SHAP-like attributions for very large tabular models by exploiting sparsity in feature interactions and using sampling + importance reweighting to reduce runtime while preserving faithfulness. The method targets production use where exact SHAP is too slow or memory-heavy.

Introduce sparsity-aware sampling schemes that select only a subset of feature coalitions most likely to contribute to the prediction.
Use importance reweighting and low-variance estimators to correct the bias introduced by sparse sampling.
Provide theoretical bounds on approximation error under realistic sparsity assumptions for tabular datasets.
Benchmark against TreeSHAP, KernelSHAP and other approximations on speed, memory, and fidelity (AUC of attribution ranking, correlation with exact SHAP).
Implement a PyTorch/Numpy reference implementation with CPU/GPU options, and integrate an interactive demo to visualize sparse attributions and compare with full SHAP results.

Goal: enable interpretable explanations at production scale for large tabular models without prohibitive compute cost, and provide open-source code and evaluation scripts.

*Additional manuscripts in preparation on cross-domain interpretability, human-centered AI, and counterfactual reasoning frameworks.*

Research Projects

Predictive Model for Detection of Psychological Distress Using Machine Learning and Psychometric Analysis

Original Dataset Thesis · Demo

Hybrid ML pipeline (SVM + XGBoost) trained on a self-collected psychological dataset integrating DASS-21, SWLS, and BFI-10 measures to predict affective states with explainable reasoning.

Role: Lead researcher — designed survey instruments, recruited participants, curated dataset, developed hybrid models, and evaluated interpretability.

Data Collection: Personally collected and anonymized data from participants via standardized psychological questionnaires ensuring ethical consent and balanced representation.

Approach: Feature engineering on psychometric scales, hybrid SVM+XGBoost architecture, and multi-level XAI pipeline.
Key result: Strong predictive performance and interpretable insights (detailed metrics in thesis).
Impact: Deployed prototype (PsychoPredictor AI) for real-time screening and clinician review.

📄 Request Thesis Copy 💻 GitHub Repository

Thesis file uploaded and referenced.

PsychoPredictor AI — Web Platform Live

Full-stack prototype enabling questionnaire input, model inference, and SHAP-driven instance explanations for end users/clinicians.

Role: Full-stack developer & integrator

Tools: FastAPI, Jinja2, Render, joblib, SHAP

Features: Real-time model inference, SHAP instance views, CSV batch upload, and secure input handling.
Deployment: Hosted on Render with joblib-serialized models and a responsive UI for clinician workflows.
Next: Adding Force SHAP interactive exports and clinician feedback loops.

▶ Live demo 💻 GitHub Repository (Website)

*More projects, figures, and preprints will be added as they become available.*

Technical Skills

Programming & Development

Python, Java, C/C++, SQL, Git
API Development: FastAPI, Flask
Web Deployment: Render, GitHub Pages

Machine Learning & Data Science

scikit-learn, XGBoost, LightGBM, CatBoost
TensorFlow, PyTorch, Keras
Feature Engineering, Cross-Validation, Model Tuning
Smart-Sensor, AI Enabled Wireless System, Edge-AI

Data Analysis & Visualization

Pandas, NumPy, SciPy, Statsmodels
Matplotlib, Seaborn, Plotly, Altair
Statistical Testing & Exploratory Analysis

Deep Learning & Representation Learning

Neural Networks: CNNs, RNNs, Transformers
Transfer Learning & Embedding Interpretability
Experiment Tracking: MLflow, Weights & Biases (W&B)

Research & Scientific Tools

Jupyter Notebooks, Google Colab, VS Code
MATLAB (basic), R (statistical modeling)
LaTeX, Reproducible Research Practices

Cloud, Deployment & Infrastructure

Docker, GitHub Actions (CI/CD)
Google Cloud, AWS (EC2, S3 basics)
GPU Computing (CUDA environments, Colab Pro)