Cloud Reliability Engineer

Job#: 2009444

Job Description:

Apex Systems is seeking a Cloud Site Reliability Engineer to work onsite in Fayetteville, NC. Candidate must hold an Active Secret Security Clearance to be considered. To apply, please email your resume to Cameron at [email protected]
Job Description/Day to Day
  • Run the production environment by monitoring availability and taking a holistic view of system health.
  • Build software and systems to manage platform infrastructure and applications.
  • Improve reliability, quality, performance for cloud-hosted applications.
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve.
  • Provide primary operational support and engineering for multiple large, distributed software applications.
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
  • Partner with development teams to improve services through rigorous testing and release procedures.
  • Participate in system design consulting, platform management, and capacity planning.
  • Create sustainable systems and services through automation and uplifts.
  • Balance feature development speed and reliability with well-defined service level objectives.

Education:
  • Bachelor’s Degree in a STEM field.
  • DoD 8570 Level II (Security +)

Required Experience: 8+ years of related experience
Required Technical Skills:
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript.
  • Adept Shell/BASH scripter
  • Experience with distributed storage technologies like NFS, HDFS, Ceph, and S3.
  • 2+ years of experience working with container orchestration technologies, specifically Kubernetes.
Required Skills and Abilities:
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks along with an ability to offer and implement solutions to address these.
  • Experience creating dashboards to track service health that appeal to both technical and non-technical audiences preferably with Splunk.
  • Excellent written and verbal communication skills, with a strong attention to detail and a head for problem solving.
  • Skilled at working in tandem with a team, or unsupervised as required
Preferred Skills:
  • Experience working with identity and access management technologies and solutions.
  • Experience with Agile development methodologies; using collaboration tools such as Jira and Confluence.
  • Experience with monitoring and logging solutions, specifically Splunk
  • Any of the following: AWS Certified SysOps Administrator Associate or AWS Certified Solutions Architect Associate or any Professional level of the above-mentioned certs where applicable
  • 1+ years’ experience working with Gitlab
  • Skilled at creating Ansible playbooks, working with AWX/Ansible Tower
 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

EEO Employer

Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at [email protected] or 844-463-6178.

Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing® in Talent Satisfaction in the United States and Great Place to Work® in the United Kingdom and Mexico.

Employee Type:
Contract

Location:
Fort Liberty, NC, US

Job Type:
Infrastructure and Security

Date Posted:
January 26, 2024