SilverSearch, Inc. is a globally recognized media and information organization, and they are seeking a Senior Machine Learning Engineer to build and optimize large-scale ML inference systems. This role focuses on production-scale inference optimization and ML infrastructure, requiring hands-on experience in a highly technical environment.
Responsibilities:
- Design, build, and optimize large-scale ML inference systems for text, image, and video workloads
- Scale semantic/vector search and embedding pipelines across millions of media assets
- Optimize inference latency, throughput, and cost efficiency for production ML systems
- Work with transformer-based NLP and computer vision models in production environments
- Improve and operationalize multimodal AI pipelines using existing/open-source models
- Build scalable data processing systems across CPU/GPU cloud infrastructure
- Partner closely with Data Science and Platform teams to productionize ML workflows
- Contribute to hybrid search and retrieval systems using vector search and reranking approaches
- Monitor and improve performance, reliability, and efficiency across distributed ML workloads