
Job Title : Technical Support Specialist
Location: Milpitas, CA (Onsite)
Duration: 3 months+
Role Overview
This role is a hands-on, hardwarefocused technical support position supporting GPU/compute clusters in an AI lab/R&D environment. The emphasis is on hardware troubleshooting, Linux-based system support, and deep understanding of compute architecture, rather than software development.
Key Responsibilities
Troubleshoot GPU/CPU servers, compute clusters, and networking (InfiniBand)
Diagnose hardware issues (cabling, components, GPUs, servers)
Rack/stack initially limited (systems already built), but may increase if extended
Replace/install server components within racks
Use Linux command line extensively for diagnostics and system validation
Manage lab space and hardware inventory (reprocurement access provided)
MustHave Skills (NonNegotiable)
Strong hardware troubleshooting experience (servers, GPUs, compute systems)
Solid understanding of computer/compute architecture
Strong Linux skills for system bringup and troubleshooting
Experience with GPUs and highperformance compute environments
Ability to independently diagnose and resolve hardware/system issues
Preferred / NicetoHave
Prior data center or HPC/compute cluster experience (plus, not mandatory)
Scripting experience (Bash, Python) expected if candidate has done similar roles
Familiarity with GPU technologies (cuttingedge R&D GPUs; Tesla, etc.)
Candidates who ve built systems themselves (gaming PCs, lab servers, small data centers)
Experience & Education
Minimum: 3 4 years of relevant experience (not pure sysadmin only)
Bachelor s degree preferred, but experience matters more than degree
No travel required