Dexmate is seeking a Senior AI Engineer to design, build, and ship AI-powered software across the full stack. The role involves owning AI features end-to-end, from system design through implementation and production monitoring, while ensuring the reliability and effectiveness of AI agents in a production environment.
Responsibilities:
- Design, implement, and deploy production-grade AI agents: multi-step reasoning pipelines, tool-calling workflows, multi-agent coordination, and human-in-the-loop handoffs
- Design and build agent harnesses — the runtime infrastructure (context management, tool definitions, memory, feedback loops, observability, and lifecycle control) that makes agents reliable in production; the model is a component, the harness is the product
- Engineer context pipelines: dynamic retrieval, re-ranking, semantic search, and GraphRAG as tools within an agentic reasoning loop — not static RAG pipelines; understand when to retrieve, when to use long context, and when to use agent memory
- Implement production-grade reliability: retry logic with backoff, cost controls, structured output validation, sandboxed tool execution, and checkpoint-resume for long-running agent workflows
- Develop systematic evaluation frameworks (evals, golden datasets, regression suites, observability traces) that measure agent quality and catch regressions before production
- Architect and implement scalable backend services and APIs (REST/GraphQL) in Go, Rust, or TypeScript/Node.js
- Build and maintain integrations with external systems — databases, internal APIs, robot data streams — enabling agents to take real actions with appropriate access controls
- Own deployment, monitoring, and observability: Docker, Kubernetes, CI/CD pipelines, and LLM-specific tracing and cost tracking
- Build clean, functional web interfaces in React/Next.js — operator dashboards for robot fleet management, engineering tooling for the AI team, and customer-facing applications
- Own features end-to-end: product requirements, implementation, testing, rollout, and ongoing maintenance
- Treat prompt engineering as a first-class engineering discipline: write, test, and version prompts with the same rigor as application code