Naveen Busiraju — Senior GenAI Engineer Interview Prep

Session Snapshot

This unified shell keeps reviewed state, pinned practice questions, and a priority queue across technical, project, and JD prep.

0

Practice Questions

0

Reviewed

0

Pinned

0%

Coverage

Start Here Today

Open your self-intro, rehearse one flagship project story, then clear 5 priority technical questions.

Pinned Questions

Use pins as your final-hour shortlist.

Priority Queue

High-value practice questions that are still unreviewed.

Recently Reviewed

Resume context stays fresher when you can jump back into the last answers you touched across the full prep bank.

Venkata Naveen Busiraju

Gen AI Application Developer · Atos (Syntel) · Chennai

5.5+

YoE

65%

Review Time ↓

96%

OCR Accuracy

55%

AR Resolution ↓

2

Cloud Certs

60-Second

90-Second

Technical Interviewer

⏱ 60-Second Introduction

"I'm Naveen, a Gen AI Developer with 5.5 years of experience, currently at Atos. My core focus is building enterprise-grade AI solutions — specifically RAG systems, multi-agent workflows, and intelligent document pipelines. My two flagship projects: a Long-context RAG system using Azure OpenAI GPT-4 and LangGraph that reduced compliance document review time by 65%, and an Account Receivable Email Agent with an OCR-to-LLM pipeline that achieved 96% field extraction accuracy and cut invoice resolution time by 55%. I work primarily in Python — LangChain, LangGraph, HuggingFace, PyTorch — deployed on GCP and AWS using Docker, Kubernetes, and GitHub Actions. I hold an MSc in Computer Science from the University of East London and I'm AWS Certified DevOps Engineer Professional. I'm excited about this role because it maps directly to what I do — conversational AI, RAG, OCR pipelines, and cloud deployment at enterprise scale."

End with one sentence connecting your background to THIS specific role. Swap the last sentence with the JD's core requirement.

⏱ 90-Second Introduction

"I'm Naveen, a Gen AI Application Developer with 5.5 years of experience, currently at Atos — an IT services company. For the past two years, my work has centred on building production-grade AI applications for enterprise clients. On the AI development side, I've built two major systems. First, a Long-context RAG pipeline — using Azure OpenAI GPT-4, LangChain, and LangGraph — that analyses learning course materials against compliance checklists and automates reporting to SharePoint. That system cut document review time by 65% and handles multi-agent orchestration with dedicated retrieval, validation, and synthesis agents. Second, an Account Receivable Email Agent that classifies inbound invoices, extracts structured data using OCR and Llama-3, validates it against an ERP schema, and achieves 96% field-level accuracy — reducing AR resolution time by 55%. My technical stack spans Python, LangChain, LangGraph, HuggingFace Transformers, PyTorch, pdfplumber, pytesseract, FastAPI, Docker, Kubernetes, and GCP and AWS for deployment. I also have experience with LoRA and QLoRA fine-tuning using the PEFT library. Beyond the AI work, I've built out CI/CD pipelines, led microservices migrations from legacy Java to Angular and Node.js, and contributed to GCP infrastructure for a Decarbonization project. I hold a Master's from the University of East London and I'm AWS Certified DevOps Engineer Professional and Google Cloud Associate Cloud Engineer. This role aligns closely with what I do every day — and I'm particularly excited about the opportunity to work at scale on conversational AI and enterprise knowledge systems."

Use this for HR/recruiter screens or first-round interviews. Adjust the final sentence to name the company.

🔬 Technical Interviewer Version

"Naveen — GenAI developer, 5.5 years. Primary focus: enterprise LLM systems. Two production systems worth highlighting. One: a Long-context RAG system built on LangGraph with a multi-agent architecture — retrieval agent, checklist validation agent, synthesis agent, and orchestrator. Used Azure OpenAI GPT-4 with hierarchical parent-child chunking, cross-encoder reranking, and a faithfulness validation pass before output. Deployed on GCP with Docker and Kubernetes, CI/CD via GitHub Actions including a prompt regression test gate. 65% reduction in review time. Two: an invoice intelligence pipeline — OCR pre-processing with OpenCV, pdfplumber for native PDFs, pytesseract with LLM correction pass for scanned documents, structured extraction via Pydantic + JSON mode, ERP-ready output. 96% field-level accuracy. Async FastAPI endpoint, monitored via LangSmith and Cloud Monitoring. I've also done LoRA/QLoRA fine-tuning with HuggingFace PEFT for domain-specific document Q&A, and I manage ML deployment lifecycle — model versioning, canary releases, semantic caching, rate-limit handling with multi-region fallback. Stack: Python, LangChain, LangGraph, MCP, PyTorch, HuggingFace, Qdrant/Vertex AI Vector Search, GCP, AWS, Docker/K8s."

Lead with architecture decisions and production specifics. Technical interviewers want to see depth, not bio.

📋 Key Talking Points Checklist

Always mention:
✓ 5.5+ years experience
✓ LangGraph multi-agent (differentiator)
✓ 65% + 55% impact numbers
✓ 96% OCR accuracy
✓ GCP + AWS deployment
✓ AWS DevOps Professional cert

Adapt by role:
GenAI focus → lead with RAG + agents
Platform focus → lead with GCP + K8s
Product focus → lead with business impact
Research focus → lead with LoRA + RLHF
Chatbot focus → lead with agent architecture

Long-Context RAG

AR Email Agent

CI/CD & GCP Deploy

Decarbonization

LLM Fine-Tuning

Project Overview

Enterprise-grade Long-context RAG (LRA) system for automated compliance checking of learning course materials against a Data Quality Checklist (DQC), with automated reporting to SharePoint.

65%

Review Time ↓

4

Agent Nodes

5+

File Formats

Business Problem & Why It Was Needed

Problem: Compliance teams manually reviewed every course material (PDFs, DOCX, PPTX, XLSX files) against a checklist of 50+ quality criteria. Each review took 2–3 hours, was inconsistent between reviewers, and created a bottleneck as the learning catalogue scaled.

Why AI: The process was rule-based enough to automate (structured checklist) but required language understanding sophisticated enough that simple pattern matching failed — courses used varied phrasing to express the same quality attribute. An LLM-powered system could understand semantic equivalence across phrasings.

Architecture / Technical Design

Document Upload
PDF/DOCX/PPTX/XLSX

→

Extraction Layer
pdfplumber + python-docx

→

Chunking
Parent-Child Hierarchical

→

Embedding
Azure OpenAI ada-002

→

Vector Store
Qdrant

Checklist Query

→

Retrieval Agent
ANN + Reranker

→

Validation Agent
GPT-4 + CoT

→

Synthesis Agent
Report Generation

→

SharePoint Output

LangGraph orchestration: Four nodes with conditional edges — retrieval failure routes to query reformulation → retry. Validation failure routes to human review queue. State checkpointing at every node boundary for fault tolerance.

My Exact Contributions

• Designed the full multi-agent architecture using LangGraph with MCP tool integration
• Implemented hierarchical parent-child chunking strategy to handle 100+ page documents
• Built the cross-encoder reranking step (post-retrieval) that improved faithfulness by ~12%
• Wrote structured CoT prompts for the validation agent with few-shot examples
• Implemented Pydantic schema enforcement for all agent outputs
• Built the SharePoint write integration using Microsoft Graph API
• Set up LangSmith tracing and GCP Cloud Monitoring dashboard
• Wrote the GitHub Actions CI/CD pipeline including prompt regression test gate

Technologies Used

Python 3.11LangChainLangGraphMCP Azure OpenAI GPT-4QdrantFastAPI pdfplumberpython-docxopenpyxl Cohere RerankPydanticLangSmith DockerKubernetes (GKE)GitHub Actions GCP Cloud MonitoringSharePoint Graph API

Key Challenges & Solutions

Challenge 1 — Long context / hallucination: 100+ page documents exceeded context windows; naive chunking caused hallucination on checklist items.
Solution: Hierarchical parent-child chunking with 15% overlap. Parent chunks (1024 tokens) for broad context; child chunks (256 tokens) indexed for retrieval. Added faithfulness validation agent — second LLM pass checking every claim against retrieved context.

Challenge 2 — Agent pipeline fault tolerance: Silent failures mid-pipeline caused data loss without the orchestrator detecting them.
Solution: Every agent node emits a structured status payload (success/partial/failed + error_reason). LangGraph conditional routing handles each status explicitly — retry with reformulated query, or route to human-review queue.

Challenge 3 — Multi-format document handling: PPTX and XLSX required different extraction strategies than PDFs.
Solution: Built a document router that selects extraction strategy by MIME type. Each extractor normalises output to a standard DocumentChunk schema before the chunking layer.

Measurable Outcomes

Review time reduction

65%

Faithfulness score

0.92

Pipeline success rate

97%

Reviewer consistency

↑95%

Lessons Learned / If I Redid It

• Would add GraphRAG for multi-hop compliance questions that span document sections
• Would implement Matryoshka embeddings to reduce vector store memory by 3x without recall loss
• Would add A/B testing infrastructure at the prompt level from day one — prompt changes were hard to roll back without it
• Would use OpenTelemetry from the start for distributed tracing instead of adding it later

Interview Drill Bank

Project Overview

Intelligent email agent that classifies incoming AR emails, extracts and validates invoice data from attachments using an OCR + LLM pipeline, and produces ERP-ready JSON output — reducing manual data entry for the accounts receivable team.

55%

AR Resolution ↓

96%

Field Accuracy

4

Pipeline Stages

Business Problem & Why It Was Needed

Problem: AR teams received hundreds of vendor emails daily. Each email required: reading the subject/body to understand the request type, downloading the invoice attachment, manually keying 8–12 fields into the ERP system, validating against purchase orders, and filing the email. This 15–20 minute process per invoice was the primary bottleneck in the payment cycle.

Business case: 55% reduction in resolution time directly translates to faster payment processing, fewer late payment penalties, and reduced headcount requirement for manual processing.

Pipeline Architecture

Email Ingestion
IMAP / Graph API

→

Classification
Intent + Priority

→

OCR Pre-processing
OpenCV + Router

→

LLM Correction
Llama-3

→

Schema Extraction
Pydantic + JSON Mode

→

ERP Output

My Exact Contributions & How 96% Accuracy Was Achieved

Accuracy improved incrementally across pipeline stages:

Step 1 — Baseline (72%): Raw pytesseract on varied invoice formats
Step 2 → 81%: Added OpenCV pre-processing (deskew, binarize, contrast enhance)
Step 3 → 87%: PDF type routing — pdfplumber for native PDFs eliminates OCR errors entirely for that class
Step 4 → 93%: Llama-3 LLM correction pass cleans OCR artifacts in context
Step 5 → 96%: Pydantic schema validation with retry — failed validation triggers a targeted re-extraction prompt focused only on the failing field
Remaining 4%: Flagged to human review queue — not silently passed to ERP

Technologies Used

PythonLlama-3pdfplumber pytesseractOpenCVPydantic FastAPI (async)LangGraphMicrosoft Graph API RedisDockerGCP

Key Challenges & Solutions

Challenge — OCR accuracy on poor-quality scans: Photographed and faxed invoices had skew, smudges, low contrast. Raw pytesseract scored 72%.
Solution: Three-stage OpenCV pre-processing pipeline: (1) deskew using Hough transform to detect rotation angle, (2) adaptive thresholding for binarization, (3) CLAHE contrast enhancement. Applied before every pytesseract call.

Challenge — Financial data integrity: A hallucinated invoice amount of $10,000 instead of $1,000 would create real business harm.
Solution: Conservative confidence thresholds — any field with extraction confidence below 0.85 (or schema validation failure) routes to human review rather than auto-submitting to ERP. Designed for precision over recall.

Interview Drill Bank

Project Overview

Led CI/CD pipeline design and ML model deployment infrastructure on GCP and AWS. Containerised all AI services, implemented automated testing including prompt regression gates, and managed Kubernetes deployments with zero-downtime rolling updates.

Pipeline Architecture

GitHub Actions Pipeline Stages:
1. lint + unit tests (pytest, flake8)
2. Prompt regression tests — run eval harness against golden dataset; fail if faithfulness drops > 5%
3. Docker build + image scan (Trivy for CVEs)
4. Push to GCP Artifact Registry with commit SHA tag (never :latest)
5. Deploy to staging via kubectl rolling update
6. Integration tests on staging
7. Manual approval gate for production
8. Deploy to production with HPA config

Technologies Used

GitHub ActionsDockerKubernetes (GKE) HelmTrivyGCP Artifact Registry Cloud MonitoringLangSmithPytest AWS ECSTerraform

Interview Drill Bank

Project Overview

Developed and integrated Java Spring Boot solutions for a client's Decarbonization tracking platform deployed on GCP. Contributed to data ingestion APIs, metrics calculation services, and real-time reporting dashboards.

Contributions

• Built REST APIs in Spring Boot for carbon emission data ingestion from IoT sources
• Implemented GCP Cloud Pub/Sub consumers for real-time streaming data processing
• Migrated legacy reporting module to Angular for a modern, responsive UI
• Configured GCP Cloud Run for containerised microservice deployment
• Participated in CI/CD setup using GitHub Actions and Docker on GCP

Technologies Used

Java Spring BootAngularNode.js GCP Cloud RunCloud Pub/SubBigQuery DockerGitHub Actions

Interview Drill Bank

Project Overview

Experimented with LoRA/QLoRA fine-tuning using HuggingFace Transformers and PEFT to adapt base LLMs for domain-specific document Q&A tasks where the base model consistently misclassified domain-specific checklist items.

Approach & Findings

Problem: GPT-4 base model with prompting misclassified ~15% of domain-specific compliance items due to unusual domain vocabulary tokenising into meaningless subword fragments.

Approach: Used QLoRA (4-bit quantisation via bitsandbytes) with LoRA adapters (r=16, alpha=32, target_modules: q_proj, v_proj) on a 7B Llama model. Synthetic training data generated by GPT-4 — 3,000 domain Q&A triples from compliance documents. Trained on a single A100 using HuggingFace Trainer with early stopping on validation loss.

Key hyperparameters: LR=2e-4, cosine schedule, warmup ratio=0.03, batch size=8 with gradient accumulation × 4, 3 epochs.

Outcome: Domain-specific classification accuracy improved by ~8% over prompted GPT-4 on the held-out eval set. General capability regression was less than 1% on MMLU benchmark.

Technologies Used

PythonHuggingFace TransformersPEFT bitsandbytes (QLoRA)Llama-3 7BPyTorch Datasets (HF)MLflowWandb

Interview Drill Bank

📅 Day-to-Day Activities — Sample Answer

"My day-to-day revolves around three core areas — AI solution development, pipeline reliability, and cross-functional collaboration. On the development side, I spend most of my time iterating on RAG pipelines and agent workflows. That means working on chunking strategies, embedding model selection, vector store configuration, reranker tuning, and prompting — all measured against a golden evaluation dataset. I work heavily in Python with LangChain and LangGraph, and every prompt change is version-controlled and regression-tested before it ships. For pipeline reliability, I monitor our OCR and extraction pipelines — reviewing extraction accuracy dashboards, debugging edge cases on unusual invoice formats, and improving pre-processing steps. If human review rates spike, that's my first indicator something regressed — I investigate before users notice. On the infrastructure side, I review GitHub Actions workflows, validate Docker image builds, and keep an eye on our GCP deployments — latency, pod health, token cost per request. If a model update is going out, I run the eval harness first. Stakeholder-facing work takes maybe 20% of my time — translating business requirements into AI system designs, demoing outputs to compliance or finance teams, and incorporating their feedback. Those sessions often surface edge cases that our automated eval doesn't catch. No two days are the same, but the core loop is always: build → evaluate → debug → deploy."

🔄 Development Lifecycle

Requirements → POC (Streamlit) → Architecture review → Build → Prompt regression tests → Staging deploy → Stakeholder demo → Production → Monitor

📊 Monitoring Responsibilities

LangSmith traces · Cloud Monitoring dashboards · Human review rate · Token cost/request · P95 latency · Extraction accuracy trends

🤝 Stakeholder Interactions

Compliance teams · AR finance teams · Product managers · Infrastructure team · Client-side business analysts

🛠️ Maintenance Ownership

Vector store index freshness · Prompt version management · Embedding model upgrades · Dependency security patches · On-call for P1 incidents

Tell me about a time you solved a technically difficult problem. ▼

SITUATION

In my LRA compliance system, the RAG pipeline was returning high faithfulness scores on our eval set but compliance teams were flagging incorrect outputs on real documents. Something was breaking in production that our tests didn't catch.

TASK

I needed to identify the root cause, fix it without breaking existing behaviour, and prevent recurrence — while the compliance team was actively using the system.

ACTION

I enabled full LangSmith trace logging in production (with sampling off temporarily) and reviewed traces for the flagged documents. I found the issue: our chunking strategy was splitting compliance clauses across chunk boundaries — the key phrase was always in position 1 of one chunk and the final word in position 0 of the next. The retriever found chunks near the relevant clause but not the clause itself (lost-in-the-middle effect). I increased chunk overlap from 5% to 15% and added a post-retrieval check that merged adjacent chunks when retrieved chunks shared document IDs within 2 positions.

RESULT

Faithfulness on the previously-failing document class improved by 18%. I added those document types to the golden eval set so the regression test would catch this class of failure going forward. I also wrote a post-mortem documenting the root cause and the fix.

Tell me about a time you disagreed with a technical decision and how you handled it. ▼

SITUATION

During the AR Email Agent project, a senior team member proposed using full synchronous processing for invoice extraction — keeping it simple, processing emails in sequence. I believed this would create a bottleneck at peak invoice submission times (end of month).

TASK

I needed to make the case for an async queue architecture without undermining the senior engineer's authority or creating team friction.

ACTION

I ran a quick load test simulation — modelled peak email volume (200 invoices/hour end-of-month) against average processing time (8–12 seconds per invoice including LLM calls). The synchronous model would create a 30-minute backlog. I prepared a simple comparison doc — sync vs async architecture, showing latency profile, retry logic complexity, and infrastructure cost. I presented it in the next design review as a 'trade-off analysis' rather than a disagreement, and proposed starting with sync for the MVP but building the queue abstraction layer from day one so the async swap would be low-risk.

RESULT

The team adopted the async approach from the start. When we hit peak month-end load in production, the queue handled the burst gracefully with no user-visible backlog. The senior engineer acknowledged the trade-off analysis approach was useful and asked me to do similar analyses for future architectural decisions.

Tell me about a failure and what you learned from it. ▼

SITUATION

Early in the AR Email Agent project, I upgraded the embedding model from text-embedding-ada-002 to text-embedding-3-small — the newer model had significantly better MTEB benchmark scores, and I assumed it would be a straightforward improvement.

TASK

Deploy the embedding model upgrade in production.

ACTION (the failure)

I updated the query embedding model in the config and deployed — without re-indexing the vector store. The existing vectors were encoded by the old model and the new model produces vectors in a different geometric space. Retrieval quality dropped to near-random within minutes of the deploy.

RESULT & LESSON

I rolled back immediately — total downtime under 10 minutes. The lesson: embedding models are not interchangeable. Any embedding model change requires full re-indexing before switching query traffic. I added a deploy-time check to the CI/CD pipeline that compares the query embedding model config against the metadata of the vector store's current index — if they don't match, the deploy is blocked. I also documented this as a known failure mode in our runbook.

How do you prioritise when multiple high-priority tasks compete? ▼

SITUATION

During a two-week sprint, I had three competing priorities: a P1 production incident (human review rate spiked to 40%), a feature deadline for the compliance team (new checklist category integration), and a security patch for a vulnerable dependency.

ACTION

I applied a simple triage framework: 1) Unblock production first — the spiked human review rate indicated a quality regression affecting live users. I spent half a day identifying and fixing the root cause (a prompt version accidentally reverted during a deploy). 2) Security patch second — it was a 30-minute fix and the risk of deferral was high. 3) Feature work third — I communicated to the compliance team early (not on the day of the deadline) that I'd need 2 extra days. I provided an updated timeline with a clear reason and no vague language.

RESULT

Production was restored within 6 hours. Security patch shipped the same day. Feature was delivered 2 days late but with no surprise to the stakeholder. The compliance team appreciated the early communication — they said it was better than other teams who often went silent when behind.

Formula: unblock production → security/compliance → committed deadlines → new features. Communicate delays early, not on the day.

Describe a situation where you had to influence without authority. ▼

SITUATION

The infrastructure team had standardised on a specific vector database (a commercial managed service) for all AI projects. I believed Qdrant — self-hosted on GKE — would reduce our costs by 60% at our document volume with no meaningful quality trade-off. I had no authority over infrastructure decisions.

ACTION

I ran a time-boxed two-day benchmark: deployed Qdrant on a GKE cluster, re-indexed our corpus, and ran identical retrieval quality tests against both options. I tracked: recall@5, P95 latency, monthly cost projection at 10M vectors. I presented the results at the next architecture review with a risk analysis — what could go wrong with self-hosting and how we'd mitigate it (backup procedures, upgrade path, GKE auto-healing).

RESULT

The infrastructure team adopted Qdrant for our project after reviewing the benchmark. The principle I follow: data beats opinion. I don't argue about tool choices — I run experiments and present numbers.

Tell me about a time you improved a process or system proactively. ▼

SITUATION

Our RAG pipeline had no automated quality monitoring — we only found out about quality regressions when users complained. With monthly model deployments and frequent prompt iterations, regressions were a constant risk.

ACTION

I built a lightweight eval harness — 100 fixed query-context-answer triples from production with human-verified correct answers. Integrated RAGAS faithfulness and answer_relevance metrics. The harness runs as a GitHub Actions step on every PR — fail threshold: faithfulness drop > 5% vs main branch. Added a Slack notification when scores approach the threshold so we can investigate before it triggers.

RESULT

Over 3 months after implementation, the harness caught 4 regressions before they reached production — 3 from prompt changes, 1 from a chunking parameter change. None of those regressions reached users. The team adopted the eval harness as a standard practice across all AI projects in the team.

All

Fundamentals

RAG / Retrieval

Agents / LangChain

OCR / Doc AI

Chatbots

Deployment / MLOps

Observability / Eval

Showing all 79 questions

All

Core Python

Internals

OOP / Patterns

DSA

Concurrency / Async

Memory / Performance

Showing all 47 questions

All

Flask / FastAPI

REST / API Design

Auth / Security

Streaming / WS

Background Workers

Showing all 6 questions

All

SQL / PostgreSQL

NoSQL

Redis / Cache

Vector DBs

Optimization

Showing all 20 questions

All

Docker

Kubernetes

CI/CD

GCP / AWS

IaC / Config

Showing all 13 questions

All

System Design

Distributed Systems

Event-Driven

HA / Fault Tolerance

Scaling Strategies

Showing all 34 questions

Showing all 4 questions

Showing all 3 questions

✅ Strong Alignments

JD Requirement	Your Evidence	Strength
AI chatbots & conversational agents	AR Email Agent, LangGraph multi-agent workflows	●●●●●
RAG-based architectures	Long-context RAG, 65% time reduction, reranking, HyDE	●●●●●
OCR pipelines	pdfplumber + pytesseract + OpenCV, 96% accuracy	●●●●●
Python AI/ML frameworks	LangChain, HuggingFace, PyTorch, Scikit-Learn, spaCy	●●●●●
GCP deployment & scaling	Docker + GKE + GitHub Actions + GCP Associate cert	●●●●○
RESTful APIs	FastAPI async endpoints, Spring Boot, Node.js	●●●●○
CI/CD & MLOps	GitHub Actions with eval gates, AWS DevOps cert	●●●●●

⚠️ Gap: IBM Watson

Suggested answer: "I haven't worked with Watson directly. My production experience is with Azure OpenAI and open-source LLMs. The core concepts — intent classification, entity extraction, dialog management — are transferable across platforms, and I've implemented these from scratch with LangGraph. I'm confident I can pick up Watson's tooling quickly given the depth of my LLM architecture experience."

🎯 JD Practice Bank

Use this bank to rehearse the role-specific must-haves after you review the alignment summary above.

🔍 Profile Gap Assessment

IBM Watson / Dialogflow

Gap: No direct Watson or Dialogflow experience.
Strategy: Frame as "same problem, different tools." Emphasise your custom LangGraph implementation covers the same patterns — intent routing, entity extraction, dialog state management — but at a lower abstraction level, demonstrating deeper understanding.

Formal ML Research / Publications

Gap: Applied engineering background, not ML research.
Strategy: Emphasise production-grade engineering as a differentiator — you build systems that work at scale, not just on benchmarks. Frame applied experience as the harder, more valuable skill for product roles.

Team Leadership / Direct Reports

Gap: No formal people management experience.
Strategy: Highlight technical leadership — drove architecture decisions, mentored junior developers on LangChain patterns, led design reviews, produced technical documentation adopted team-wide.

Strengths That Differentiate

✓ LangGraph multi-agent experience (rare in the market)
✓ Production-grade OCR + LLM pipeline with quantified accuracy
✓ Full-stack AI deployment — model to monitoring
✓ Both GCP + AWS certified — cloud-agnostic
✓ Fine-tuning with LoRA/QLoRA (LoRA rank, hyperparameters, eval)