Shubam Gupta • AI & Cloud Architect

Shubam Gupta

Member of Technical Staff @ AMD • AI/ML Specialist • Platform Architect

About Me

Technology leader with 9+ years transforming complex challenges into scalable AI-powered solutions that deliver measurable business impact.

6.25×
Capacity Improvement
System Utilization Gains
87%
Forecasting Accuracy

Currently at AMD, I'm spearheading the PRISM AI platform — architecting multi-agent systems that integrate RAG pipelines, vector databases, knowledge graphs, and conversational AI to revolutionize hardware validation workflows across global edge infrastructure.

My expertise spans distributed event-driven architectures, cloud-native platforms (AWS/Azure), Kubernetes orchestration, and intelligent automation — consistently delivering solutions that scale from proof-of-concept to production systems handling millions of events daily.

AI/ML Platform Engineering Distributed Systems Cloud Architecture Edge Computing Technical Leadership

Core Skills & Expertise

AI/ML Engineering LLMs, RAG, GenAI
Vector Databases Qdrant, Milvus
Knowledge Graphs Cosmos DB, Graph Analytics
Conversational AI Chatbots, NLP, Intent
Platform Architecture Distributed Systems
Event-Driven Architecture Kafka, EventHub, Async
Workflow Orchestration Job Scheduling, Routing
Edge Computing Global Infrastructure
Cloud Platforms AWS, Azure, Multi-Cloud
Kubernetes & Docker Container Orchestration
CI/CD Pipelines GitHub Actions, Azure DevOps
DevSecOps Security Scanning, OWASP
FastAPI / Flask / Django REST APIs, Microservices
Spring Boot Java, Enterprise Apps
Hardware Integration BMC, IoT, Embedded
Message Queues Kafka, RabbitMQ, Streaming
Data Analytics Real-time Dashboards
Databases SQL, NoSQL, Graph DBs
Data Engineering ETL, Spark, Databricks
Monitoring & Observability ELK, Grafana, Logging
Blockchain Ethereum, Smart Contracts
IoT Systems Device Integration, Protocols
AR/VR Development Unity, 3D Applications
Technical Leadership Team Management, Mentorship

Tech Stack

Python

Python

FastAPI

FastAPI

Kubernetes

Kubernetes

Docker

Docker

Kafka

Kafka

TensorFlow

TensorFlow

NodeJS

NodeJS

AWS

AWS

Azure

Azure

Redis

Redis

Java

Java

JavaScript

JavaScript

GitHub Actions

GitHub Actions

Azure DevOps

Azure DevOps

Google Cloud

Google Cloud

Azure AI

Azure AI

Elastic

Elastic

Databricks

Databricks

PostgreSQL

PostgreSQL

MySQL

MySQL

MongoDB

MongoDB

Milvus

Milvus

Qdrant

Qdrant

RabbitMQ

RabbitMQ

Grafana

Grafana

Spring Boot

Spring Boot

LangChain

Hugging Face

Hugging Face

IoT/Arduino

IoT/Arduino

PyTorch

PyTorch

Experience

Member of Technical Staff @ AMD

Sept 2023 – Present

Platform Architecture & Scalability

  • Co-architected PRISM Automation Platform — distributed event-driven system processing 1M+ log entries for silicon, BIOS, and memory validation with autonomous BMC debug data collection
  • Scaled validation capacity 6.25× (32 → 200+ concurrent runs) through global edge infrastructure across 5 sites, reducing cycle time by 35% and latency by 40%
  • Engineered Station Descriptor service — core configuration management powering intelligent hardware search and workload orchestration across 200+ stations

AI/ML & Intelligent Systems

  • Leading PRISM AI initiative — multi-agent system with LLM-powered conversational interface, RAG pipelines, vector database triage (50% faster debug), and knowledge graph-based log intelligence
  • Built Azure Cosmos DB knowledge graph with custom prismai library, transforming log analysis from hours to minutes
  • Co-developed Intelligent Routing Engine using Kubernetes orchestration, cutting queue time by 40% and doubling resource efficiency

Cloud Engineering, DevOps & Data Analytics

  • Architected end-to-end CI/CD pipeline with automated security scanning (SonarQube, Trivy, OWASP), reducing deployment failures by 70% and enabling 50+ safe weekly releases with quality gates and rollback capabilities
  • Led Azure DevOps → GitHub Actions migration, improving build parallelism by 3× and cutting deployment time by 67% (45→15 minutes) through optimized workflow orchestration
  • Built real-time analytics dashboard tracking config utilization, availability, and demand — improving resource efficiency by 45% and enabling predictive capacity planning

Technical Leadership & Team Development

  • Co-led Core Validation team onboarding to PRISM platform, accelerating adoption by 40% through comprehensive training programs and hands-on workshops
  • Mentoring engineers on cloud migration strategies, Kubernetes deployments (n8n, NocoDB alerting systems), and distributed systems architecture patterns

Assistant General Manager @ Infinity Learn

July 2021 – Sept 2023

  • Led product management and technical transition of WizKlub Futurz program following acquisition, ensuring seamless integration of learning platform and curriculum delivery systems.
  • Architected and deployed VISTA — Generative AI application leveraging LLM technology to provide interactive, conversational responses to student queries on NCERT Science topics with contextual accuracy.
  • Directed ML engineering team in research and development of custom domain-specific LLMs for solving advanced Physics, Chemistry, and Mathematics problems tailored for IIT JEE preparation.
  • Pioneered AI-driven auto-evaluation framework for Blockly-based coding assessments, using Generative AI to analyze student code logic, assign scores, and generate detailed justifications for automated feedback at scale.

Chief Architect @ WizKlub Learning Pvt. Ltd.

May 2021 – June 2023

  • Led 20+ member cross-functional engineering team, orchestrating agile release cycles and delivering weekly/monthly product milestones for K-12 adaptive learning platform.
  • Architected enterprise-grade AWS cloud infrastructure integrating Lambda, IoT Core, ELK stack, and multiple AWS services while optimizing for cost efficiency, security, and performance.
  • Championed DevSecOps transformation by embedding automated security and quality gates (JMeter, SonarQube) into CI/CD pipelines, reducing vulnerabilities and accelerating time-to-market.
  • Designed and deployed highly customizable LMS platform serving WizKlub's core curriculum with intuitive content management and adaptive learning workflows.
  • Pioneered no-code Alexa Skills builder leveraging AWS Lambda, Alexa Skills Kit, and Google Blockly to democratize voice app development through drag-and-drop interfaces.
  • Engineered IoT middleware framework bridging WizKlub hardware devices with AWS IoT Core using Blockly-based visual programming for seamless device onboarding and control.
  • Implemented comprehensive observability stack with AWS CodeCommit for version control and ELK for real-time monitoring, logging, and performance analytics.

Multiple Roles @ Cognizant

Nov 2016 – May 2021

Data Scientist

  • Developed automated data collection framework from enterprise systems & databases
  • Built ML-based sales forecasting models achieving 87% accuracy across regions
  • Created CNN-based text extraction from images with automated ML wrapper
  • Implemented employee attrition prediction models for workforce planning

Chatbot/Virtual Assistant Developer

  • Built scalable chatbot framework using Flask, Sklearn, NLTK, SpaCy
  • Designed preprocessing pipeline: spell correction, semantic analysis, lemmatization
  • Created omni-channel chatbot (WhatsApp, Slack, Facebook, Google Assistant, SMS)
  • Developed Alexa Skill with Salesforce & Ethereum blockchain integration

Technology Lead

  • Led 4-member team building chatbot development platform
  • Architected containerized deployment framework (Docker, Redis, RabbitMQ)
  • Implemented monitoring stack (Kibana/Logstash/Grafana) for logging & alerts
  • Managed agile processes, sprint planning, and cross-team collaboration

R&D Engineer

  • Explored blockchain networks and built enterprise solutions on top
  • Developed AR application using Unity, Vuforia, 3DS Max for information augmentation
  • Created IoT-based product authentication for supply chain (Particle Photon, Ethereum)
  • Integrated ML analytics for real-time product status monitoring & alerting

Patents & Awards

US 2022/0005263 A1

Automated Data Visualization & Modification (2022)

US 2022/0141022 A1

Securing & Authenticating Serialized Data (2022)

Awards & Recognition

Spotlight Award (6×)

Advanced Micro Devices

Exceptional contributions to PRISM platform architecture, AI initiatives, and technical leadership

Hero Award

Advanced Micro Devices

Building Intelligent Routing solution achieving 2× system utilization improvement

Growth Champion

Infinity Learn

Recognized for exceptional leadership potential and future-readiness

Knowledge Stalwart Award

Cognizant Academy

Excellence in knowledge sharing, technical mentorship, and community building

Rising Star Award

Cognizant

Outstanding performance and rapid business impact delivery

Orbit Shifter Award

Cognizant

Top 2% performance in company-wide coding assessment

Let's Connect

Open to collaborations, speaking engagements, and open-source projects.