Overview:
We are seeking an experienced CAE HPC Specialist to manage, support, and optimize our on premises high-performance computing (HPC) infrastructure for the Virtual Engineering Group. The ideal candidate will have significant experience with Linux HPC systems and a deep understanding of CAE engineering software, including Ansys and Altair CFD, FEA and multi-physics applications.
The CAE HPC Specialist will be responsible for performance tuning, user support, license management, troubleshooting, automation and system configuration.
The CAE HPC Specialist will support a global engineering team across multiple locations, requiring strong expertise in remote system administration and software deployment.
Responsibilities:
Key Responsibilities:
-
HPC System Administration & Performance Optimization:
-
Lead HPC performance tuning
-
Ensure efficient job scheduling and resource allocation using Open PBS.
-
Optimize parallel processing and remote visualization workflows (VNC).
-
Monitor system health, usage, and troubleshoot hardware/software issues.
-
CAE Software Support & Troubleshooting (Global Team):
-
Troubleshoot job submission failures, simulation crashes, and visualization issues.
-
Assist users with best practices for simulation workflows and HPC utilization.
-
Maintaining software compatibility across updates and system upgrades
- Install and maintain CAE software on Workstations and ensure version alignment with HPC
-
License & Software Asset Management:
-
Assign and prioritize CAE license access for global users
- License tracking, usage reports, and cost-benefit analysis.
-
HPC Support & Ticketing Coordination:
-
Act as the primary contact for HPC-related support tickets.
-
Maintain an internal knowledge base for common troubleshooting issues.
Qualifications:
Qualifications & Skills:
-
10+ years of experience in CAE, CFD, FEA and HPC systems
- Strong expertise in Linux (Red Hat 8.7), HPC job scheduling (Open PBS), and remote visualization (VNC).
-
Hands-on experience with Ansys Fluent, Maxwell, Altair Hypermesh, OptiStruct, and Radioss.
-
Familiarity with license server management (FlexNet, LM-X, etc.).
-
Strong scripting skills (Bash, Python) for automation and troubleshooting.
-
Experience with performance tuning and parallel computing for CAE applications.
-
Excellent problem-solving, communication, and global user support skills.
-
Essential skills:
Preferred Qualifications:
-
Experience in automotive systems, thermal and electromotive systems and their simulation software.
-
Knowledge of containerization tools like Docker and Kubernetes.
-
Familiarity with AI/ML workflows in HPC environments.
-
Experience with benchmarking and performance analysis of HPC systems.
-
Familiarity with cloud-based HPC solutions and hybrid computing environments.
-
Knowledge of IT security policies, data encryption, and access control management.