Back

Linux HPC Engineer

Worldwide Salaried Open

RedLine Performance Solutions (RedLine) has been in the High Performance Computing (HPC) solutions engineering services business for over 26 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. RedLine provides IT infrastructure management and technical support services to some of the world’s largest supercomputing sites. The Linux/HPC Engineer will primarily work on a small team of HPC Systems Administrators responsible for the installation and operational support of an HPC cluster located in Phoenix, Arizona. Operations run 24x7 and therefore there will be a rotational on-call requirement. The Linux/HPC Engineer will actively participate in the evolution and maintenance of the technical infrastructure, in addition to supporting the on-site HPC environment. The position can be remote, but will be required to support the normal business hours for the primary customer site in Phoenix, AZ. In addition to supporting the HPC cluster in Phoenix, the Engineer will also contribute to other infrastructure and customer initiatives as business needs arise. The Engineer will be required to shift priorities, support parallel efforts, and provide technical expertise across multiple projects, including deployments, upgrades, troubleshooting, and documentation. Additional assignments may include short-term tasking in adjacent programs, collaboration with cross-functional engineering teams, and participation in planned maintenance windows or special projects to meet organizational commitments. Travel to different customer sites is expected to be a maximum of 25% of the time. US citizenship is a mandatory requirement for this position. This full-time (W-2) position offers a full benefits package including paid time off, 401k match, and health care benefits. Required Skills:

  • 5 or more years of Linux systems administration, preferably in a Red Hat and/or Rocky environment
  • Strong knowledge of TCP/IP networking
  • HPC system administration experience (e.g., parallel file systems, cluster management, archival systems)
  • Strong experience in Bash, Perl, and Python scripting in a version-controlled environment using Git
  • Strong verbal and written communication skills, with the ability to coordinate between multiple team members in remote locations between several disparate projects
  • Strong organizational skills

Preferred Skills/Experience:

  • Experienced with system engineering in addition to system administration
  • Cloud administration (e.g. Azure, GCP, AWS)
  • Experience with deploying and supporting computational models and simulations in HPC infrastructure (e.g., on-premise and cloud, with containers).
  • Knowledge and understanding of application hosting, with experience using Cloud Services in a Commercial Infrastructure as a Service (IAAS) or Platform as Service (PAAS) environment.
  • Red Hat Certification (e.g., RHCSA, RHCE)
  • Server automation experience (e.g., Puppet, Foreman, Ansible)
  • Experience with job scheduling software (e.g., Slurm or Moab)
  • Experience with cluster automation tools (e.g., xCAT, HPCM, or Bright Cluster Manager)
  • Familiarity with a wide range of server and networking hardware (e.g., HPE, SuperMicro, NetGate, Juniper, etc.)
  • Applications such as Atlassian Confluence, Gitlab, or Mediawiki

To learn more about RedLine, please visit our website at www.RedLinePerf.com Apply tot his job Apply To this Job

More jobs

Senior Systems Administrator – Power Platform, Azure

Worldwide Salaried

Windows/VMware Administrator

Worldwide Salaried

IBM Power Systems Administrator/Engineer (IBM i and AIX) ---100% REMOTE

Worldwide Salaried

Windows / Linux Systems Administrator - REMOTE (US Citizenship required)

Worldwide Salaried

Sr System Administrator

Worldwide Salaried

Sys Admin Windows/UNIX (RHEL) - Remote

Worldwide Salaried

Systems Administrator (Remote)

Worldwide Salaried

Senior Network and Computer Systems Administrator job at Cayuse Software in US National

Worldwide Salaried

Credentialing Systems Administrator - Remote - Long Term Contract

Worldwide Salaried

Systems Administrator job at Massachusetts Institute of Technology - MIT in Lexington, MA

Worldwide Salaried

Digital Marketing Manager

Worldwide Salaried

Remote Customer Service Agent – Work‑From‑Home, Sales & Support Specialist for arenaflex

Worldwide Salaried

Experienced Data Entry Specialist – Remote Opportunity with arenaflex

Worldwide Salaried

Vice President, Deputy General Counsel - Commercial Compliance

Worldwide Salaried

Remote Customer Support Specialist - Exceptional Airline Passenger Experience Representative

Worldwide Salaried

Data Center Power and Cooling Principal Consultant

Worldwide Salaried

Experienced Data Entry Specialist – Remote Opportunity with arenaflex

Worldwide Salaried

Sales Manager

Worldwide Salaried

Especialista Soporte Técnico Cardioarritmias (Dispositivos médicos) Guadalajara

Worldwide Salaried

Experienced Customer Sales Manager – Northeast Region

Worldwide Salaried