NOV

Site Reliability Engineer

Houston, TX

Full-time

About This Job

As a Site Reliability Engineer, you will be responsible for: Operational Excellence & Incident Management

•

Maintain and monitor production systems for availability, latency, and performance.

•

Lead incident response efforts, including communication, resolution, and postmortem documentation.

•

Design and implement health checks, alerting systems, and automated remediation workflows.

•

Drive root cause analysis and implement permanent resolutions for recurring issues.

Observability & Insights

•

Set up and maintain full observability stacks (logging, metrics, tracing) using tools like Prometheus, Grafana, Datadog, OpenTelemetry, or ELK.

•

Analyze telemetry and logs to identify trends, anomalies, and opportunities for improvement.

•

Conduct post-incident reviews and use insights to inform future engineering investments.

Performance & Systems Optimization

•

Tune and optimize distributed systems, including AKKA.NET actors, for performance and resource efficiency.

•

Work with developers to evolve architecture and improve system throughput, latency, and stability.

•

Optimize PostgreSQL performance, queries, and maintenance strategies.

CI/CD & Automation

•

Design and maintain modern CI/CD pipelines using GitHub Actions, Azure Pipelines, or GitLab CI.

•

Automate deployment, testing, and rollback processes to reduce friction and increase deployment frequency.

•

Standardize infrastructure as code practices across environments.

Education and Experience

•

5+ years of experience in SRE, DevOps, or Infrastructure Engineering roles.

•

Bachelor’s degree in information technology, Computer Science, or a related

•

Expertise in Kubernetes and container orchestration at scale.

•

Strong experience with AKKA.NET or similar actor-based frameworks.

•

Proficiency with scripting and automation (Bash, PowerShell, Python).

•

Experience with observability tools (Phobos,Datadog, Prometheus, Grafana, OpenTelemetry, ELK).

•

Hands-on experience with cloud platforms (AWS, Azure, or GCP).

•

Strong PostgreSQL knowledge—performance tuning, query optimization, maintenance.

•

Proven ability to lead incident management and drive postmortem processes.

•

A builder’s mindset with high standards for operational excellence and technical ownership.

Preferred Tools & Ecosystem Experience

•

CI/CD: GitHub Actions, Azure Pipelines, GitLab CI

•

Infrastructure: Kubernetes, Docker, Terraform

•

Monitoring: Phobos (AKKA.NET), Datadog, Prometheus

•

Source Control: GitHub, GitLab, Azure DevOps

•

Programming: C#, Python, Bash, PowerShell

Similar Jobs

Electrical Engineer

Dudley Staffing

Site Reliability Engineer

Site reliability Engineer

Reliability Engineer

OceanaGold Corporation

Reliability Engineer

Reliability Engineer

The Mosaic Company

Bradley Junction, FL

Reliability Engineer

Reliability Engineer

Century Aluminum

Reliability Engineer

Arkansas Electric Cooperative Corporation

Reliability Engineer

$66300 - $187500

Reliability Engineer

Fort McMurray, AB

Reliability Engineer

Fort McMurray, AB

Reliability Engineer

JANA Corporation

Site Reliability Engineering

Reliability Engineer

Sherritt International Corporation

Fort Saskatchewan, AB

Senior Engineer - Site Reliability Engineering

Reliability Engineer I

Freeport-McMoRan

$76000 - $104000

Maintenance Reliability Engineer

$74497 - $83121

Maintenance Reliability Engineer

Vallourec - North America

Build Reliability Engineer

Stealth Startup

Trending Jobs

Electrical Engineer

Dudley Staffing

Division Order Analyst

Coronado Resources

about 2 months ago

Professional Landman

Penterra Services, LLC

Accounts Payable Clerk

$65000 - $65000

Division Order Landman

R. Lacy Services, Ltd.

about 1 month ago

contract landman

HPS Oil & Gas Properties

Oil and Gas Land and Title Analyst - SAM Associate II

Bank of America

Attorney

Toeppich & Associates

over 1 year ago

Title Landman

Sustain Land Services

Senior Landman

Greenlake Energy

Electrical Designer

Dudley Staffing

Title Reviewer

Innovation Land Services

Landman

Stockyards Energy Land Services

Oil and Gas Title Attorney

Oliva Gibbs PLLC

Civil/Structural Designer

Dudley Staffing

contract Landman

HPS Oil & Gas Properties

contract Landman

HPS Oil & Gas Properties

Senior Division Order Analyst

$110000 - $130000

about 1 year ago

Mechanical/Piping Engineer

Dudley Staffing

E & I - Office/Field Administration

Surepoint Group

Grande Prairie, AB

Notice: The inclusion of job postings or company information on our platform does not imply endorsement, partnership, or affiliation. Listings may include publicly available roles from various sources, and companies shown may not have a direct relationship with Energy Hire.