Act as the technical authority for SONiC deployments running on Cisco Silicon One platforms in large-scale production environments.
Provide architectural guidance and operational support for hyperscale data center fabrics supporting AI/ML workloads.
Troubleshoot complex issues across the stack including SONiC containers, Linux networking, routing protocols, and ASIC forwarding behavior.
Perform deep debugging and analysis using Silicon One SDK tools and ASIC telemetry to identify hardware and software interactions.
Support both production and pre-production validation environments running SONiC and Cisco platforms.
Collaborate directly with Cisco silicon architects, SONiC development teams, and software engineering groups to resolve defects and improve platform capabilities.
Optimize network fabrics designed for AI/ML clusters.
Guide the customer in adopting DevOps-driven operational models, including CI/CD pipelines for SONiC software validation and infrastructure automation.
Help implement observability, telemetry, and automated validation frameworks to improve reliability at hyperscale.
Act as a trusted technical advisor, building strong relationships with customer network engineering and infrastructure teams.
Requirements
Bachelor’s degree in Computer Science, Electrical Engineering, or related field (or equivalent experience)
10+ years of experience in networking, distributed systems, or hyperscale infrastructure
Deep knowledge of data center networking architectures and large-scale routing environments
Strong troubleshooting skills across software, hardware, and network protocols
Experience working in large production environments with minimal supervision
Deep hands-on experience with SONiC architecture and operations
Understanding of SAI, Redis DB model, containerized services, and control-plane architecture
Strong knowledge of Linux networking stack and kernel internals
Experience debugging network behavior using Linux tools and system-level diagnostics
Familiarity with Docker or Kubernetes-based service orchestration
Experience working with merchant silicon or programmable ASIC platforms
Exposure to Cisco Silicon One architecture or similar programmable switching silicon
Ability to analyze packet forwarding pipelines and hardware counters
Experience supporting AI cluster networking or HPC fabrics
Knowledge of RoCE networking, congestion management, and performance optimization
Strong automation experience with Python and Linux tooling
Familiarity with open-source development workflows and Git-based collaboration
Strong working knowledge of networking protocols such as BGP, ISIS, MPLS and Segment Routing, VXLAN / EVPN, IPv6
Tech Stack
Distributed Systems
Docker
Kubernetes
Linux
Python
Redis
Switching
Benefits
medical, dental and vision insurance
401(k) plan with a Cisco matching contribution
paid parental leave
short and long-term disability coverage
basic life insurance
10 paid holidays per full calendar year
1 floating holiday for non-exempt employees
1 paid day off for employee’s birthday
paid year-end holiday shutdown
4 paid days off for personal wellness
16 days of paid vacation time per full calendar year
flexible vacation time off program
80 hours of sick time off provided on hire date
up to 80 hours of unused sick time carried forward from one calendar year to the next
optional 10 paid days per full calendar year to volunteer