Site Reliability Engineer
Location: Irving, TX or Charlotte, NC (Hybrid Onsite)
Role Summary
We are seeking a highly skilled Application Operational Support / Site Reliability Engineer to support and operate mission-critical enterprise applications in a highly regulated environment. This role is responsible for ensuring platform reliability, availability, and operational excellence through strong CI/CD practices, observability, incident management, and customer-facing remediation.
The ideal candidate combines strong technical troubleshooting skills with disciplined operational practices and the ability to work independently with stakeholders
Key Responsibilities
- Support production and pre-production environments to ensure high availability, performance, and stability of enterprise applications.
- Support and maintain CI/CD pipelines using tools such as GitHub Actions, Harness, or similar.
- Partner with engineering teams to improve deployment reliability, reduce manual steps, and enable repeatable releases.
- Assist with deployment automation and release coordination across environments.
- Execute Incident, Change, and Problem Management processes using ServiceNow.
- Lead or contribute to major incident calls, ensuring clear communication, coordination, and resolution.
- Perform root cause analysis and drive permanent fixes through problem management practices.
- Monitor application and platform health using tools such as Splunk, Grafana, AppDynamics, or equivalent.
- Configure dashboards, alerts, and monitoring thresholds to proactively identify issues.
- Use telemetry data to identify performance bottlenecks and reliability risks.
- Partner with application, infrastructure, and security teams to resolve complex cross-functional issues.
- Identify operational gaps and recommend improvements to tooling, processes, and automation.
- Contribute to runbooks, operational documentation, and standard operating procedures.
- Support platform modernization initiatives aligned with reliability and scalability goals.
Required Skills & Experience
Core Skills
- 5+ years of experience in application/platform operations, production support, or SRE roles.
- 3+ years of experience with CI/CD pipelines (GitHub Actions, Harness, or similar tools).
- Solid understanding of Incident, Change, and Problem Management processes, preferably using ServiceNow.
- 2+ years of experience with observability and monitoring tools such as Splunk, Grafana, AppDynamics, or equivalent.
- Excellent troubleshooting and critical thinking skills, with the ability to diagnose complex production issues.
- Proven experience interacting directly with customers or business stakeholders during operational events.
Technical Competencies
- Strong understanding of application deployment, runtime environments, and system dependencies.
- Ability to read logs, metrics, and traces to identify root causes.
- Familiarity with cloud-native or hybrid enterprise environments.
Nice-to-Have Skills
- Experience with VM image creation/build processes.
- Exposure to OpenShift / OCP or Kubernetes-based platforms.
- Experience operating in regulated environments (banking, financial services).
Recommended Jobs
Area Operations Director- Central District
Description This position provides oversight of the following store locations: Mountain Island, Second Editions: Goodwill Opportunity Campus, Second Editions: Rock Hill, Shopton, South Blvd, Steel…
Business and Marketing Strategy Intern
WHAT MAKES US EPIC? At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-w…
Heidelberg Printing Press Operator- 1st Shift
Heidelberg Press Operator- 1st Shift **REQUIRES PRIOR EXPERIENCE PROGRAMMING AND RUNNING LITHO/OFFSET SHEETFED PRINTING PRESSES** Summary: Operate sheetfed litho/offset printing presses - specif…
Registered Nurse - Behavioral Health
Job Details Description Do you have a heart for community care? At Easterseals PORT Health (ESPH) , our mission is rooted in empowering individuals and strengthening communities. We’re seekin…
Mobile Diesel Mechanic
Overview: Are you ready to wrench with a mechanically savvy team with leadership that cares and wants you to succeed? Bring your tools to the fastest growing fleet maintenance network in the country.…
Field Appraiser - McDowell County
Vision Government Solutions is looking for North Carolina-based Field Appraisers to join our Reassessment team. Vision performs reassessment services on behalf of local governments throughout the U.S.…
Office Manager
Accentuate Staffing is currently recruiting for an Office Manager for an established company in Raleigh. The Office Manager is responsible for the overall daily operations of the building, overseeing …
Demand Planner
Key Responsibilities Develop, maintain, and continuously improve demand forecasts based on customer orders, sales inputs, historical data, and market trends. Translate demand forecasts into p…
European Automotive Technician
About Us : At Eurotechnik, we’re not just in the business of cars—we’re in the business of changing the face of the automotive repair industry. Our goal is to make the exceptional the norm. We take …