Junior HPC Systems Engineer
Excelerate
Bruyères-le-Châtel, Essonne
Are you passionate about High-Performance Computing (HPC) and eager to grow your career in this dynamic field?
Salary: €50,000
Location: Bruyères-le-Châtel, France
Employment Type: Permanent
A European leader in High-Performance Computing, is looking for a Junior HPC System Engineer to join our team. This is an exciting opportunity to contribute to world-class HPC solutions while working in an inclusive and supportive environment, alongside experts in the field. If you are curious, inventive, and daring, we would like to hear from you to help our client solve some of the most complex scientific challenges of today and tomorrow.
About the Role
As a Junior HPC System Engineer, you will be responsible for assisting in the administration of HPC systems, including supercomputers and storage solutions, to ensure they operate efficiently. This role will allow you to work hands-on with cutting-edge technologies and develop valuable skills in a collaborative, high-level team environment.
Key Responsibilities
- Assist in the administration of HPC systems, including supercomputers and associated storage systems.
- Install, configure, optimize, and maintain software across several thousand computing nodes.
- Support software maintenance operations and help prepare for updates and fixes.
- Contribute to implementing high availability solutions (HA, Pacemaker, Corosync).
- Develop automation procedures using scripts (Bash, Python).
- Assist in writing technical documentation and operating procedures (Wiki).
- Help diagnose and resolve production incidents, escalating when necessary.
- Support internal and partner teams in resolving customer tickets and improving services.
- Provide Level 1 and Level 2 support for a software stack based on CentOS.
- Participate in monitoring and handling technical escalations in cooperation with internal L2/L3 teams and partners.
Desired Skills & Experience
- Familiarity with or a desire to learn the administration of GNU/Linux HPC systems (RedHat, CentOS, or others).
- Knowledge of Lustre File System is a plus.
- Understanding of networking technologies (InterConnect, Infiniband, Ethernet, RoCE).
- Experience with containers (Docker, OpenStack) and orchestration tools (Puppet, Ansible).
- Basic scripting skills (Shell, Python, Perl).
- Ability to configure/modify key Linux services such as DNS, DHCP, Web, FTP, and authentication.
- Familiarity with monitoring tools like Nagios.
- Exposure to hardware such as network switches, X86 servers, and disk bays (DDN, ClusterStor).
- A foundational knowledge of C programming is advantageous for code analysis and compilations.
Languages:
- Technical operational English is required.
If you are ready to begin your journey in HPC and are eager to learn and contribute, apply today to join.