Jobs / Everforce Software Pvt. Ltd.
Sr. Fullstack Developer Lead (NVIDIA & Kubernetes)
Everforce Software Pvt. Ltd. · Santa Clara, CA, United States
Santa Clara, CA, United StatesExp: 10+ yrsOnsite
Remuneration
Not specified
Location
Santa Clara, CA, United States
Visa sponsorship
Not specified
Job summary
Seeking an experienced Full-stack Development Lead with expertise in NVIDIA, Kubernetes, and AI Ops Deployment. The role involves leading the deployment, automation, and optimization of scalable AI/ML solutions across cloud and on-premise environments. This is an onsite position in Santa Clara, CA, requiring 10+ years of experience.
Qualifications
- Strong hands-on experience with Kubernetes and AI Ops development
- Expertise in NVIDIA GPU stack, CUDA, Tensor Rt, and Triton Inference Server
- Experience with AI infrastructure, GPU computing, and AI deployment pipelines
- Strong experience with Azure Cloud and/or Google Cloud Platform (GCP)
- Experience deploying GPU-enabled Kubernetes clusters in cloud and on-premise environments
- Strong knowledge of containerization technologies, including Docker and Helm
- Experience with CI/CD pipelines, DevOps practices, and infrastructure automation
- Strong understanding of Micro-services and Cloud-Native Applications
- Experience with Python scripting and automation
- Knowledge of MLOps frameworks and AI/ML infrastructure management
- Experience in performance optimization, troubleshooting, and distributed systems
- Strong understanding of scalable architecture and platform reliability
- Excellent technical leadership and collaboration skills
- Experience with large-scale AI/ML production environments
- Familiarity with Infrastructure as Code tools such as Terraform
- Experience with enterprise AI platform modernization initiatives
- Strong background in cloud-native architecture and automation
Responsibilities
- Design, deploy, and manage scalable Kubernetes-based AI infrastructure platforms
- Package and deploy AI/ML applications using Kubernetes and container technologies
- Create, test, and maintain Kubernetes clusters across cloud and on-premise environments
- Build and optimize GPU-enabled AI environments using NVIDIA technologies
- Collaborate with DevOps and engineering teams to implement AI deployment pipelines and cloud-native solutions
- Develop and maintain CI/CD pipelines for automated deployments and infrastructure provisioning
- Support AI Ops, MLOps, and cloud modernization initiatives
- Monitor, troubleshoot, and optimize distributed systems and platform performance
- Ensure scalable, secure, and high-performing AI infrastructure architectures
Skills
AzureDockerGCPHelmKubernetesPythonTerraform
Contract length
6–12+ Months
Relocation
No