HPC Storage Engineer – Bruyères-le-Châtel, Essonne

HPC Storage Engineer

Excelerate

Bruyères-le-Châtel, Essonne

Postuler

Are you a skilled HPC professional with a passion for storage systems and looking to advance your career in a leading tech company?

Salary: €60,000 – €70,000

Location: Bruyères-le-Châtel, France

Employment Type: Permanent

A European leader in High-Performance Computing, is looking for an HPC Storage Engineer to join their expert team. This is an exciting opportunity to work on world-class HPC clusters, ranked in the Top 500, and play a pivotal role in delivering high-performance solutions to solve the most complex scientific challenges. If you are ready to contribute your skills to cutting-edge technologies in an inclusive and dynamic environment, please read on.

About the Role

As an HPC Storage Engineer, you will focus on the administration and maintenance of HPC systems, with a particular emphasis on storage systems and parallel file systems. You'll be part of a collaborative team responsible for ensuring the operational efficiency of our HPC clusters and managing high-availability solutions and software maintenance.

Key Responsibilities

  • Administer HPC systems, including supercomputers and associated storage systems.
  • Install, configure, optimize, and maintain various File Systems in operational conditions.
  • Deploy, configure, and manage Parallel File Systems, particularly Lustre.
  • Implement high availability solutions (HA, Pacemaker, Corosync).
  • Develop automation procedures using scripts (Bash, Python).
  • Write technical documentation and operating procedures (Wiki).
  • Diagnose, analyze, and resolve production incidents.
  • Handle customer support tickets, resolving or escalating issues to internal or partner support teams.
  • Manage technical escalation files in coordination with internal L2 and L3 support teams or partners.
  • Provide Level 1 and Level 2 support for the software stack based on CentOS, including diagnostics, patch implementation, and escalation to L3 support.
  • Participate in an on-call rotation (approximately one week per month).

Desired Skills & Experience

  • At least 5 years of experience in Linux system administration, with a focus on HPC environments.
  • Expertise in the administration of GNU/Linux HPC systems (RedHat, CentOS, or others).
  • Strong knowledge of Lustre File System.
  • Familiarity with networking technologies (InterConnect, Infiniband, Ethernet, RoCE).
  • Experience with containerization technologies (Docker, OpenStack) and orchestration tools (Puppet, Ansible).
  • Scripting skills (Shell, Python, Perl).
  • Experience configuring/modify key Linux services (DNS, DHCP, Web, FTP, authentication, deployment management).
  • Familiarity with monitoring tools like Nagios.
  • Experience with hardware operation (network switches, X86 servers, disk bays such as DDN, ClusterStor, etc.).
  • Basic understanding of C programming for code analysis and compilations.

Languages:

  • Technical operational English is required (French is a plus).

If you're ready to contribute to the future of HPC storage systems and work on impactful projects, apply today.

Postuler

Voir tous les emplois