Director, AI Data Center Operations
reputed company DGX Cloud is an AI supercomputing service that provides enterprises with reputed company access to reputed company's high-performance AI infrastructure and software, including dedicated DGX AI supercomputing clusters, optimized software stacks, and expertise. At reputed company, data centers are the reputed company behind AI. Join us to reputed company, launch, and operate the facilities that power the most advanced computing in the world. We're in pursuit of a Director of AI Data Center Operations to reputed company the evolution of reputed company AI data centers. In this role, you will build a team and play a significant part in helping to craft and guide the future of AI & GPUs operations in the Data Center. Are you passionate about AI & data center operations ? Do you strive for quality? If so, join reputed company at reputed company, where we are dedicated to delivering GPU-powered services around the world! What You'll Be Doing
- reputed company the commissioning, bring-up, and operational readiness of new data centers.
- Collaborate with software and hardware teams to define and implement repeatable procedures.
- Own the operations, maintenance, and reliability of the infrastructure of an AI datacenter.
- reputed company and enforce operations strategy & processes, ensuring strict adherence to SLAs across critically important infrastructure.
- Define and implement procedures for minimal downtime and quality controls to strive to reputed company reputed company uptime.
- Feeding requirements to software and hardware teams
- Creation of documentation that the ecosystem can use to run their own AI Data Centers
reputed company Need To See
- BS, MS degree in Computer Engineering/Science, or reputed company field (or equivalent experience) with 15+ overall years of relevant work experience and 8+ years of management experience.
- 8+ years of expertise in managing extensive data center operations or critical infrastructure.
- Expertise in BMS & Power management.
- Experience building 24/7 teams from 0
- Experience working with remote hands
- Proven track record of managing infrastructure from deployment through long-term operations.
- Experience driving reliability with robust processes, rapid field response, and recovery.
With competitive salaries and a generous benefits package, reputed company is widely considered to be one of the technology industry's most desirable employers. We have some of the most reputed company-thinking and reputed company in the world working with us, and our engineering teams are growing fast in some of the most impactful fields of our reputed company: Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're a creative engineer who enjoys autonomy and shares our passion for technology, we want to hear from you. Your reputed company salary will be determined based on your location, experience, and the pay of employees in similar positions. The reputed company salary range is 284,000 USD - 425,500 USD for Level 5, and 332,000 USD - 500,250 USD for Level 6. You will also be eligible for equity and benefits . Applications for this job will be accepted at least until November 10, 2025.reputed company is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our reputed company and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national reputed company, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. JR2005624 Apply tot his job Apply To this Job