Continue to develop, productize and maintain a business critical internal application for HPC/AI cluster design and simulation having dependencies on other internal critical applications.
Use AI-assisted development tools (Claude, Gemini, etc.) to accelerate coding, prototyping, and problem-solving
Participate in the full software lifecycle: design, implementation, testing, deployment, and maintenance
Collaborate with HPC, infrastructure, and AI stakeholders to translate requirements into technical solutions
Ensure performance, scalability, and reliability of the application
Improve and automate development workflows using CI/CD best practices
Contribute to code reviews, documentation, and knowledge sharing
Continuously explore new tools, frameworks, and approaches to enhance productivity and product quality while maintaining production grade standards.
Requirements
Strong experience in software development in a Linux environment
Familiarity with AI-assisted coding tools (e.g., Claude, Gemini, Copilot) and modern development practices
Experience building and maintaining web applications or internal tooling
Understanding of HPC, distributed systems, or AI infrastructure (clusters, GPUs, networking, scheduling) is highly desirable
Knowledge of:
o APIs and backend development
o Frontend frameworks (React, Vue, or equivalent)
o Databases (SQL/NoSQL)
o Version control (Git)
Experience with CI/CD pipelines, automation, and DevOps practices using AI-driven tooling.
Ability to troubleshoot, debug, and optimize complex systems
Tech Stack
Distributed Systems
Linux
NoSQL
React
SQL
Vue.js
Benefits
Work on cutting-edge AI-assisted development workflows
Contribute to a strategic tool for HPC/AI infrastructure design
Be part of a culture that values innovation, learning, and collaboration
Opportunity to grow your expertise in both software engineering and AI/HPC domains