A
AION2h ago
Career Pages

Forward Deployed ML Engineer

Bengaluru, Karnataka, India
Full Time
Mid Level

Auto Apply to 50+ AI Matched Forward Deployed ML Engineer Jobs

Use Auto Apply Agents to Bulk Apply jobs with ATS Optimised Resumes, find verified Insider Connections for jobs at AION

Responsibilities

Qualifications & Requirements

Experience Level: Mid Level

Full Job Description

AION is seeking a Forward Deployed ML Engineer to join our team in Bengaluru, Karnataka, India. In this role, you will act as a hands-on AI engineer, similar to an AI startup CTO, with 3-5+ years of experience in building production-grade multimodal AI systems and LLM applications. You will work within small, agile teams to deliver critical customer projects, embedding directly at client sites. Your responsibilities will include architecting, building, and deploying intelligent agent solutions, translating ambiguous business requirements into impactful technical solutions, and managing the full AI deployment lifecycle from use case discovery to production optimization. You will be comfortable writing production code, presenting to C-level executives, and debugging complex AI systems in real-world environments. Experience with voice agents, video processing systems, conversational AI, RAG systems, and LLM orchestration frameworks is highly desirable. Exceptional communication, customer empathy, and a drive to build transformative AI solutions are essential.

Customer Engagement & Multimodal Agent Development

Engage directly with customers at their sites to conduct discovery workshops and technical assessments, identifying high-impact AI opportunities. Design and architect end-to-end multimodal agent systems (voice, video, text) leveraging AION's distributed GPU infrastructure. Build production-grade voice AI systems using STT, TTS APIs, and LLMs. Develop vision-enabled agents processing real-time video streams using computer vision pipelines. Implement multi-agent orchestration with frameworks like LangChain or LlamaIndex for tool use, memory management, and autonomous task completion. Rapidly prototype POCs, validate concepts, and iterate based on feedback. Optimize for sub-500ms latency, natural conversation flow, and real-time system responsiveness. Integrate agents into customer codebases via REST/GraphQL/WebSocket APIs and custom SDKs.

Serve as a trusted technical advisor to customers, shaping their AI strategy and guiding roadmap decisions.

Data Strategy & MLOps Infrastructure

Design data architectures with efficient processing pipelines and ingestion workflows for training and inference on AION's platform. Implement RAG systems with vector databases, optimizing embedding strategies, chunk sizes, and retrieval methods. Prepare and validate datasets for fine-tuning, evaluation, and synthetic data generation. Collaborate with MLEs, MLOps, and SREs for model deployment and productionization.

Observability, Evaluation & Production Operations

Implement LLM and agent observability and monitoring, tracking key metrics like token usage, latency, costs, and quality. Instrument applications to trace LLM calls, retrieval operations, agent actions, and data flows. Build evaluation frameworks with offline benchmarks and online monitoring to ensure system performance and identify drift.

Technical Skills & Experience

We encourage you to apply if you meet some of these requirements and are eager to learn the rest:

  • 3-5+ years of hands-on experience building production AI/ML systems, with 1-2+ years deploying LLM applications.
  • Multimodal AI expertise: practical experience with voice agents, vision systems, or conversational AI.
  • Strong LLM foundations: hands-on with foundation models, fine-tuning, prompt engineering, and evaluation.
  • Agent framework proficiency: production experience with LangChain, LlamaIndex, or similar.
  • Voice AI platform experience: built real-time conversational systems with STT/TTS integration.
  • Proficiency in Python (production-grade, async, type hints) and JavaScript/TypeScript (full-stack).
  • RAG implementation experience: built retrieval-augmented generation systems with vector databases.
  • MLOps & deployment: hands-on with Docker, Kubernetes, CI/CD, and IaC.
  • Cloud platforms: experience with AWS, Azure, or GCP for ML workloads.
  • Exceptional communication skills for technical and business stakeholders.
  • Customer-facing experience (Solutions Architecture, TAM, Pre-Sales) is highly desirable.
  • Computer vision experience (video processing, object detection, VLM) is a plus.
  • Model fine-tuning experience (LoRA/QLoRA, SFT, RLHF) is a plus.
  • Inference optimization experience (vLLM, TensorRT-LLM, Triton, quantization) is desirable.
  • Observability tooling experience for LLM monitoring, tracing, and evaluation is a strong plus.
  • Familiarity with WebRTC, real-time streaming, and low-latency media processing.

Why Join AION?

  • Work directly with founders shaping technical and product strategy.
  • Build infrastructure powering the future of AI compute globally.
  • Significant ownership and impact with competitive equity.
  • Competitive compensation, flexible work options, and wellness benefits.

Apply now by sharing your resume highlighting relevant projects and leadership experience, links to your work (GitHub, demos), and a brief note on why AION's mission excites you.

Company

A

AION

AION is pioneering a decentralized AI cloud platform focused on high-performance computing (HPC). Our mission is to democratize access to compute power and provide managed services, creating an end-to...

Bengaluru, Karnataka, India
Posted on Career Pages
Forward Deployed ML Engineer, Agents at AION | Bengaluru, Karnataka, India | Apply Now | MindMyJob | MindMyJob - AI Job Search Platform