Work both independently and collaboratively with other members of the archive team or group after receiving initial direction and requirements from technical project leads.
Troubleshoot, diagnose root cause of system failures, and isolate the components / failure scenarios while working with internal & external stakeholders
Develop and publish updates on resolutions and communicate findings internally.
Work with team members to make modifications and additions to existing systems, code, and methods.
Work with team to bring up new hardware and test functionality.
Participate in process improvement, including deep multi-system problem isolation and resolution often in collaboration with administrators of other HPC subsystems.
Work with team members to document, design, and implement new ideas and approaches for newer architectures and improve those for existing ones.
Present best practices, experience reports, and/or research results to managers and to peers locally or at conferences.
Scientist 3 ($98,900 - $165,100)
In addition to the duties outlined above, a successful Scientist 3 candidate will be required to:
Work as a technical leader/subject matter expert to propose and implement solutions to current problems and future deficiencies in our HPC archive storage environment in conjunction with junior and senior administrators and technical staff within and across teams.
Proactively create experiments and tooling to validate solutions and to detect and diagnose hardware health issues.
Analyze published research papers in the area of archive and data storage, summarize, and share implications and connections to ongoing work with team members.
Interact and/or collaborate with people from other teams, groups, divisions, directorates, and programs to develop, implement, and/or communicate technical solutions.
Enhance technical and professional expertise of other staff and students through active mentoring and training activities.
Contribute to peer review of the work of others across organizations or disciplines within the laboratory.
Present best practices and research results to national peers at conferences, workshops, and meetings, as well as participate in national strategic partnerships.
What You Need
Minimum Job Requirements:
Strong interpersonal and written and oral communication skills.
Demonstrated ability to work within a team environment.
Demonstrated knowledge of building, configuring, and administering production Linux computer/storage systems.
Practical experience scripting in Bash, Perl, Python, or similar languages.
Ability to mentor and lead individual junior team members and students.
Broad knowledge of data storage administration.
Knowledge of storage system hardware.
Working knowledge of networking concepts and practices.
Knowledge of or experience with hardware and software security practices.
In addition to the Job Requirements outlined above, qualification at the Scientist 3 level requires:
Broad demonstrated knowledge of production HPC system management topics, including networking, programming, file systems, operating systems, and configuration management, with depth in one or more areas.
Demonstrated programming experience including compiled languages and advanced scripting.
Ability to lead and mentor teams, students, or junior team members.
Demonstrated ability to initiate, design, and lead projects.
Demonstrated ability to evaluate competing HPC subsystem technologies.
Ability to analyze published research papers in the area of data storage, summarize research results, and share implications and connections to ongoing work with team members.
Ability to present technical papers and/or technical work to peers locally or at conferences.
Desired Skills:
Experience deploying and managing SAN infrastructure.
Knowledge of parallel/distributed file systems (e.g., Lustre, GPFS, Panasas, Glustre).
Demonstrated experience building, configuring and managing parallel or distributed file systems.
Knowledge of file systems such as ZFS, EXT, XFS.
Working knowledge of file system structures and algorithms.
Experience with Object storage and RESTful storage interfaces.
Experience diagnosing system software problems.
Experience supporting a scientific user base.
Experience with multiple Linux distributions.
Experience with multiple network technologies (e.g., Ethernet, IB, OPA).
Experience with revision control systems such as RCS, Subversion, or Git.
Experience with low-level system administration tools such as perf, strace, tcpdump, and vmstat.
Experience managing computers in a DOE or DOD classified environment.
Familiarity with Cfengine, Chef, Puppet, Ansible, Salt, or similar configuration and automation tools and practices.
Deep knowledge of and demonstrated experience with parallel and distributed storage systems.
Contribution to open source or non-work-related projects.
Ability to acquire and maintain a DOE Q-level clearance.
Where You Will Work
Located in northern New Mexico, Los Alamos National Laboratory (LANL) is a multidisciplinary research institution engaged in strategic science on behalf of national security. LANL enhances national security by ensuring the safety and reliability of the U.S. nuclear stockpile, developing technologies to reduce threats from weapons of mass destruction, and solving problems related to energy, environment, infrastructure, health, and global security concerns.
Additional Details:
Clearance: Q (Position will be cleared to this level). Applicants selected will be subject to a Federal background investigation and must meet eligibility requirements* for access to classified matter.
*Eligibility requirements: To obtain a clearance, an individual must be at least 18 years of age; U.S. citizenship is required except in very limited circumstances. See DOE Order 472.2 for additional information.
New-Employment Drug Test: The Laboratory requires successful applicants to complete a new-employment drug test and maintains a substance abuse policy that includes random drug testing.
Regular position: Term status Laboratory employees applying for regular-status positions are converted to regular status.
Internal Applicants: Please refer to Laboratory policy P701 for applicant eligibility.
Equal Opportunity: Los Alamos National Laboratory is an equal opportunity employer and supports a diverse and inclusive workforce. All employment practices are based on qualification and merit, without regards to race, color, national origin, ancestry, religion, age, sex, gender identity, sexual orientation or preference, marital status or spousal affiliation, physical or mental disability, medical conditions, pregnancy, status as a protected veteran, genetic information, or citizenship within the limits imposed by federal laws and regulations.The Laboratory is also committed to making our workplace accessible to individuals with disabilities and will provide reasonable accommodations, upon request, for individuals to participate in the application and hiring process. To request such an accommodation, please send an email to applyhelp@lanl.gov or call 1-505-665-4444 option 1.
Employment Status
Appointment Type
Regular
Regular
Contact Details
Contact Name
Doyle, Christine Louise
Email
cdoyle@lanl.gov
Work Telephone
Copyright (c) 1998, 2022, Oracle and/or its affiliates. All rights reserved. | Privacy Statement
Copyright (c) 1998, 2022, Oracle and/or its affiliates. All rights reserved.