Senior HPC Systems Administrator
The High-Performance Computing Center (HPCC) at the University of California, Riverside (UCR) has an opening for a Senior HPC Systems Administrator. In...
The High-Performance Computing Center (HPCC) at the University of California, Riverside (UCR) has an opening for a Senior HPC Systems Administrator. In this exciting leadership position, you will manage state-of-the-art research computing infrastructure in support of the science conducted by researchers at UCR. The Senior HPC Administrator provides technical leadership for UCR's largest high-performance computing (HPC) infrastructure, manages a complex portfolio of responsibilities at a campus-wide level, and advises upper administration on strategic decisions in research computing. The HPCC enables cutting-edge research in a wide range of science, engineering and biomedical disciplines by providing the computing hardware, software and expertise to enable pioneering discoveries. UCR is a vibrant research and teaching university with a diverse student, staff and faculty body located in beautiful Southern California. UCR is an equal opportunity employer that values and respects the importance of a diverse and inclusive workforce. In this position you won’t work alone, instead you will be part of a creative, dynamic work environment where you will collaborate with supportive colleagues.
RESPONSIBILITIES
○ Support, maintain, enhance, and expand the Linux-based HPC cluster consisting of hundreds of physical CPU/GPU nodes with thousands of cores, a multi-petabyte parallel big data storage system with backup and a high-speed internal network.
○ Supervise HPC facility staff.
○ Monitor, optimize and troubleshoot performance and functionality of the infrastructure.
○ Manage security of all HPC, networking and storage components in accordance with university policy and best practices.
○ Install, maintain, and troubleshoot research and general HPC environment software.
○ Automate and document processes throughout the HPC infrastructure including upgrades, software installs, and deployments of new hardware and services.
○ Develop and publish user and technical documentation on the use of systems. Directly support researchers, course instructors, and students to enhance success within the HPC environment. Participate in training sessions instructing users best practices for running research applications on HPC systems and managing big data storage.
MINIMUM QUALIFICATIONS
○ Bachelor’s degree in a computational field, followed by 6 years of post-baccalaureate work experience, which includes at least 3 years of Linux and/or HPC administration in a professional environment, or an equivalent combination of education and experience.
○ Excellent team and outreach abilities to network and collaborate with key contacts outside their own area of expertise.
○ Fluency in two or more programming languages and environments used in research computing such as Bash, Python, C/C++, R, Java, Tensorflow, PyTorch, Jupyter Notebooks, Rstudio Server, and Matlab.
○ Commitment to lifelong learning.
ADDITIONAL DESIRED QUALIFICATIONS
○ Experience supervising a team of computational experts.
○ Experience configuring and fine-tuning job schedulers and resource managers (Slurm, PBS, etc.).
○ Experience with parallel programming and computing on Linux clusters using C/C++, Fortran, Python, MPI, OpenMP, multithreading and multicore technologies on CPU and GPU architectures.
OTHER POSITION DETAILS
Offer will be based on the successful candidate's education and related experience.
Some of the job duties can be performed remotely with some in-person requirements.
Below are some other jobs we think you might be interested in.
-
Systems Administrator
- City of Chicago Office of Inspector General
- Chicago, IL, USA
May 23 -
Senior IT Project Manager
- Pennsylvania Turnpike Commission
- Middletown, PA, United States
- Hybrid
May 09 -
Senior Cyber Security Analyst
- Valley Water
- San Jose, CA, USA
May 13 -
Unified Technology Solutions Architect
- Pennsylvania Turnpike Commission
- Middletown, PA, United States
- Hybrid
May 16 -
Information Technology Director
- City of Tigard
- Tigard, OR, USA
May 08 -
Mobile/Web Developer (.Net)
- Air Line Pilots Association
- Atlanta, GA, USA
May 17 -
Lead Big Data Engineer (Team Lead) - Hybrid Denver, CO
- GridX
- Denver, CO, USA
- Hybrid
Jun 05 -
Director of Government Affairs, Institute for AI Policy and Strategy (IAPS)
- Institute for AI Policy and Strategy (IAPS)
- Washington, D.C., USA
- Hybrid
Jun 05