Cloud GCP or AWS or Azure (LLM hosting, GPU-based inference, cost optimization)
Large-scale AI projects leveraging LLMs (e.g., Llama, GPT, Claude, Mistral)
RAG, Agentic RAG, AI Agents, Vector DBs (e.g., FAISS, Pinecone, Weaviate, ChromaDB)
LLM-based fine-tuning techniques, Low-Rank Adaptation (LoRA), Quantization (AWQ, GPTQ, FP8, INT4)
Multi-GPU parallelization, model pruning, and knowledge distillation
Governance frameworks (e.g., AI Ethics, Explainability, Risk Mitigation)
+36 more