Indicium AI is a global AI-native consultancy that helps enterprises implement AI at scale. They are seeking an experienced AI Engineer to design, build, and deploy production-grade AI systems powered by large language models, focusing on building reliable, scalable applications.
Responsibilities:
- Design and implement production AI systems integrating LLMs, RAG pipelines, vector databases, and agentic frameworks
- Create evaluation frameworks to measure and monitor system performance, accuracy, and reliability
- Build and maintain production-grade AI applications with clean code, appropriate error handling, APIs, and data pipelines
- Experience implementing, maintaining and evaluating retrieval systems (vector/graph databases, ingestion pipelines, chunking strategies, retrieval techniques such as HyDE)
- Implement feedback loops and observability to continuously improve system performance
- Craft effective prompts and optimize for latency, cost, and quality across different model providers and configurations