Home
Jobs
Saved
Resumes
Senior MLOps Engineer at Addvisor Group | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Senior MLOps Engineer
Addvisor Group
Remote
Website
LinkedIn
Senior MLOps Engineer
Brazil
Full Time
1 week ago
No Sponsorship
Apply Now
Key skills
AWS
Azure
Google Cloud Platform
Grafana
Prometheus
AI
ML
GenAI
LLM
MLOps
GCP
Google Cloud
Azure Monitor
Datadog
About this role
Role Overview
Execute and follow established Standard Operating Procedures (SOPs) for GenAI and agent-based solutions in production
Monitor platform health, model performance, and inference pipelines
Ensure stability and availability of AI services across all environments
Investigate and resolve incidents by analyzing logs, traces, and metrics
Conduct root cause analysis (RCA) and document findings
Use observability tools (logs, metrics, tracing) to detect anomalies and performance issues
Contribute to the evolution of Standard Operating Procedures (SOPs) and runbooks
Support runtime operations of LLM-based applications and agent-driven workflows
Monitor inference performance (latency, throughput, cost)
Requirements
Experience with MLOps, ML systems, or AI platform operations
Strong troubleshooting skills using logs and observability tools
Familiarity with cloud environments (e.g., Azure, AWS, GCP)
Understanding of ML pipelines, APIs, and distributed systems
Experience with monitoring tools (e.g., Datadog, Prometheus, Grafana, Azure Monitor)
Tech Stack
AWS
Azure
Google Cloud Platform
Grafana
Prometheus
Benefits
Health insurance
Flexible working hours
Professional development opportunities
Apply Now
Home
Jobs
Saved
Resumes