background image

Senior Site Reliability Engineer

Dallas, TX

01

Description

WE ARE A TRANSFORMATIONAL PARTNER

We marry design and engineering language in ways that produce impactful and memorable experience journeys. We partner all the way to continuously improve our clients’ digital maturity. Our Studio network brings the optimal combination of skill, scale, and cost for each stage of the product development lifecycle. And to do this we need great transformational people that want to impact the projects and organizations that they work with. 

 We are looking for a skilled Senior Site Reliability Engineer to join our team. 

Responsibilities:

Site reliability engineers (SREs) empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. As we expand our customer deployments, we are currently seeking an experienced SRE to deliver insights from massive scale data in real time. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

Objectives of this Role

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide primary operational support and engineering for multiple large distributed software applications

Responsibilities

  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service level objectives

Required Skills and Qualifications

  • Must Have: Experience with Google Cloud Platform
  • Bachelor’s degree in computer science or other highly technical, scientific discipline
  • Anthos for consistency and scale.
  • Experience with Apigee Hybrid or API Gateway
  • Network planning & security from data centers to public cloud
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
  • Experience with distributed storage technologies like NFS, HDFS, Ceph, S3 as well as dynamic resource management frameworks (Rancher, Helm, Kubernetes, Docker)
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.


Cognizant Softvision is an Equal Opportunity Employer. No 3rd Party Agency Candidates.

You must be legally authorized to work in the United States without the need for employer sponsorship, now or at any time in the future.

 

Interested in this position?