Site Reliability Engineer (SRE) Job at TMS, San Jose, CA

ZURMMVp5dlRSVjFaTFVINjdwbFNoV09ESUE9PQ==
  • TMS
  • San Jose, CA

Job Description

Job Title: Site Reliability Engineer (SRE) AWS & DevOps
Location: San Jose, CA (Local Only)
Duration: 12+ Months
Experience Needed: 12+ Years


Job Overview:

We are looking for a skilled Site Reliability Engineer (SRE) with strong expertise in AWS Cloud and DevOps practices to join our growing engineering team. This role is critical in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. You will focus on automation, observability, infrastructure as code, and operational excellence, working closely with development and operations teams to deliver a seamless production environment.

Key Responsibilities:

  • Design, implement, and manage highly available and scalable infrastructure on AWS .
  • Develop and maintain Infrastructure as Code using tools such as Terraform , CloudFormation , or AWS CDK .
  • Build, manage, and optimize CI/CD pipelines for fast and reliable application delivery using tools like Jenkins , GitLab CI/CD , CircleCI , or AWS CodePipeline .
  • Monitor system performance and availability using CloudWatch , Prometheus , Grafana , ELK , or Datadog .
  • Define and implement observability , logging , and alerting to ensure system health and quick incident resolution.
  • Improve system reliability through monitoring, incident response, post-mortems, and automation.
  • Support containerized applications using Docker , ECS , EKS , or Kubernetes .
  • Collaborate with development teams to ensure operational readiness and incorporate SRE principles into the software development lifecycle.
  • Implement security best practices across AWS resources, including IAM , encryption , VPC configuration , and compliance monitoring .
  • Participate in an on-call rotation and lead incident response and root cause analysis.

Required Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related technical field (or equivalent experience).
  • 3+ years of experience in a Site Reliability Engineering, DevOps, or Cloud Engineering role.
  • Strong hands-on experience with AWS services including EC2, S3, RDS, Lambda, IAM, VPC, CloudFront, etc.
  • Proficient in scripting/programming (e.g., Python, Bash, Go).
  • Expertise in CI/CD tools and automation frameworks .
  • Experience with monitoring and observability tools .
  • Strong understanding of networking , DNS , firewalls , and load balancing .
  • Solid grasp of DevOps methodologies and agile practices .
  • Familiarity with containerization and orchestration tools.

Preferred Qualifications:

  • AWS Certifications (e.g., AWS Certified DevOps Engineer , Solutions Architect ).
  • Experience with Kubernetes , Helm , or Service Mesh .
  • Background in security and compliance (SOC2, ISO 27001, etc.).
  • Exposure to chaos engineering , SRE principles , and error budgets .
  • Experience with configuration management tools like Ansible , Chef , or Puppet .

Job Tags

Local area,

Similar Jobs

Prisma Health

Surgical Technologist Extern, Neurosurgery OR Richland, PRN Job at Prisma Health

 ...maintaining the cleanliness of the work environment. Assists in cleaning the OR for the next procedure including disposal of...  ...Location 5 Medical Park Rd Richland Facility 1510 Richland Hospital Department 15106160 Operating Room Share your talent... 

New York University

Research Assistant in the Division of Science [Biology] - Dr. David A. Scicchitano | New York University Job at New York University

 ...The laboratory of David A. Scicchitano in the Division of Science at New York University Abu Dhabi seeks to recruit a Research Assistant to work on the mechanisms underlying genome maintenance, particularly in response to agents that damage DNA. DNA damage within... 

CentiMark Corporation

Construction Laborers Job at CentiMark Corporation

 ...to lift 50 lbs., ~ Able to climb up and down ladders to minimum heights of 25 feet, ~ Have reliable transportation ~ Able to work Saturday and/or Sunday, if needed ~ Authorized to work in the United States, CentiMark is an Equal Opportunity Employer offering... 

Walt Disney World Resort

Food & Beverage Quick Service Restaurant Part Time - Walt Disney World (Hiring Immediately) Job at Walt Disney World Resort

 ...team who depend on each other to serve our guests the delicious food they expect from Disney, in an efficient manner that allows...  ...and Resorts! Specific types of locations include Quick Service Restaurants, Outdoor Vending locations, or Kiosks. The environment is friendly... 

Audacy, Inc.

Part time Talk and Sports Producer Job at Audacy, Inc.

Overview: If you love sports, WILK Newsradio, Northeast PAs premiere NewsTalk radio station, is looking for an energetic sports fan...  ...classification protected by applicable federal, state, or local law, and to comply with all applicable laws and regulations. Consistent...