Senior Site Reliability Engineer Job at Broad Reach Partners, Alpharetta, GA

TmJHZ1FSUXZRb3luZGFlTFRqVkFKOVZNcXc9PQ==
  • Broad Reach Partners
  • Alpharetta, GA

Job Description

Location: Hybrid (Alpharetta, GA – 3 days/week in office)
Type: Full-Time

We are seeking a Site Reliability Engineer to join our team and play in enhancing the stability, performance, and reliability of our production systems. You’ll work closely with development, DevOps, and security teams to improve observability, optimize system performance, and ensure production readiness. From monitoring to automation, you’ll make a direct impact on our cloud infrastructure and service reliability.

In this role, you will work hand-in-hand with our development, operations, and security teams worldwide to implement best practices, automate deployments, and ensure our platforms are reliable, secure, and scalable. Troubleshooting in Kubernetes requires deep understanding of pods, nodes, networking, scaling, logs, and service-to-service communication

This role requires a deep understanding of SRE best practices and a strong ability to troubleshoot complex issues.

Your responsibilities in this role will include:

  • Identify opportunities for automation and ensure continuous security, quality in application development by automating security checks, test executions in build and deployment pipelines.
  • Deploy and manage Kubernetes workloads to AWS EKS(A) using Helm, ArgoCD
  • You will be working with Kubernetes and responsible for ensuring that applications and clusters stay reliable, performant, scalable, and observable .
  • Collaborate with development, operations and security team to build secure, optimized and efficient pipelines.
  • Create comprehensive documentation on pipeline functionality and provide training to required members.
  • Proactively monitor system performance and identify potential issues before they become critical.
  • Participate in on-call rotation. Troubleshoot production issues and perform root cause analysis.
  • Engage in continuous learning and actively advocate for Dev(Sec)Ops, GitOps best practices and standards across the team.

We are looking for you to have the following skills and experience:

  • 8+ years of experience as a Site Reliability Engineer, or equivalent
  • Experience with tools like New Relic for monitoring and Graylog for logging.
  • 3+ years of experience with Amazon Web Services (AWS) or Microsoft Azure
  • 3+ years of experience with Kubernetes clusters - performance monitoring in Kubernetes.
  • Proficiency with public cloud environments (AWS preferred)
  • Proficiency in scripting language, like Bash, Groovy, Python
  • Excellent debugging and troubleshooting skills.
  • Ability to prioritize tasks efficiently and independently under minimal supervision.

Nice to Have

  • AWS Cloud certification
  • Familiar with .NET applications.
  • Knowledge in Terraform, Ansible, monitoring tools

This is a full-time role and we are unable to sponsor so you must be a USC or be a Green Card holder. We are working onsite a few days each week in our Alpharetta offices so you must live in Atlanta and within commuting distance of our office. If you thrive on solving complex technical challenges, have a passion for automation, and want to influence how enterprise platforms evolve and modernize, this is an ideal opportunity for you.

Ready to take the next step in your SRE career? Apply now and help us build the future of reliable systems!

Job Tags

Remote job, Full time, Live in, Work at office, Worldwide, 3 days per week,

Similar Jobs

Onsite Dental

Registered Dental Assistant Job at Onsite Dental

 ...Join Our Team in Redmond, WA! Onsite Dental is a dynamic, patient-focused dental practice...  ...a skilled and compassionate Dental Assistant to join our team. In this role, you will...  ...operations. Qualifications: Active Registered Dental Assistant licensure in Washington... 

Pepitos Mexican Restaurant

Busser Job at Pepitos Mexican Restaurant

 ...Job Description Job Description The Busser job description includes, but is not limited to: Ability to work in a fast-paced, family...  ...safety guidelines Monitoring the open dining sections of the restaurant for empty and cleaned tables, clearing, wiping tabletops,... 

Whole Foods Market

Cake Decorator - Full Time Job at Whole Foods Market

 ...Job Description A career at Whole Foods Market is more than just the work you do- it's about your personal growth and creating meaningful change. Our purpose is to nourish people and the planet. That means improving how people eat, funding grants for school gardens,... 

New York City Department of Investigation

Desktop Support Engineer Job at New York City Department of Investigation

The job description is comprehensive and detailed, but it can be improved in terms of formatting and focus. I will remove repetitive content, especially the duplicated 'Preferred Skills' section, and organize the information more clearly using the allowed HTML tags. I will...

Medical City Dallas

Histology Technician Job at Medical City Dallas

 ...:00am to 11:30am Last year our HCA Healthcare colleagues invested over 156,000 hours volunteering in our communities. As a(an) Histology Technologist with Medical City Dallas you can be a part of an organization that is devoted to giving back! Benefits Medical City...