Implement Sophisticated AI Agents: Design, build, and deploy complex AI agents using LangChain and LangGraph
Master Prompt & Context Engineering: Design, test, and refine complex prompts and contextual data frameworks to ensure our AI agents perform with maximum accuracy, efficiency, and reliability
Lead AI Research & Innovation: Stay at the bleeding edge of AI
Build for Production Scale on GCP: Engineer and operate our AI systems in a scalable, reliable production environment on Google Cloud Platform
Champion MLOps for Agentic Systems: Establish and lead best practices for the reliability, versioning, monitoring, and observability of our AI agents
Collaborate to Deliver Impact: Partner closely with product leaders, data scientists, and other engineers to translate business needs into technical reality
Champion modern software development practices by actively using AI code-assist tools to accelerate development cycles
Build, manage, and mentor a cross-functional team of software, quality, and reliability engineers, fostering a culture of technical excellence and continuous improvement
Define and report on key engineering metrics (SLA, SLO, SLI) and ensure compliance with security, quality, and financial operations best practices
Collaborate with product managers, architects, SREs and business partners to define technical strategy and create software roadmaps
Lead troubleshooting efforts to resolve production and customer issues
Requirements
Bachelor's degree or equivalent experience
7+ years in software engineering, with a strong track record of technical leadership and shipping complex, scalable systems
Experience in a dedicated AI/ML role, with hands-on experience in model integration, MLOps, and applying AI to solve business problems
Direct experience architecting and building solutions with LangChain, LangGraph, or similar agentic AI frameworks
In-depth experience with Google Cloud Platform (GCP) , specifically its AI/ML services (Vertex AI, etc.)
3+ years of proven experience leveraging Kubernetes workloads
Proficiency in Python, JavaScript/TypeScript and/or Java and working knowledge of a modern front-end framework (Angular, React, or Vue) to collaborate effectively with UI teams
Hands-on experience with LLM observability tools like Langfuse for monitoring and debugging agentic workflows
Tech Stack
Angular
Cloud
Google Cloud Platform
Java
JavaScript
Kubernetes
Python
React
TypeScript
Vue.js
Benefits
Comprehensive compensation and healthcare packages
401k matching
Paid time off
Organizational growth potential through our online learning platform with guided career tracks