Sr. Infrastructure/SRE Engineer

Montevideo, Uruguay

Full Time

Technology

Mid Level

**We welcome all people based in the Montevideo Metropolitan Area to apply. This job is a hybrid role from this location. **

The Sr. Infrastructure/SRE Engineer will collaborate with organizational leads and cross-functional teams to ensure our infrastructure, automation, and reliability practices align with business priorities. Will lead reliability-focused infrastructure initiatives, implement observability best practices, and drive operational excellence. Keen attention to detail, strong problem-solving abilities, and deep expertise in cloud systems are essential. This role will focus on building resilient infrastructure, implementing configuration management, enhancing CI/CD pipelines, and improving system performance, scalability, and availability to meet SLOs. A Sr. Infrastructure/SRE Engineer is also responsible for incident response and on-call management, driving root cause analysis, facilitating blameless postmortems, and implementing remediation plans to prevent recurrence. Will work to continuously improve monitoring, alerting, and automated recovery mechanisms to minimize downtime and ensure high service reliability.

Things You'll Do:

Ensure high reliability and uptime of production systems through proactive monitoring, incident response, and capacity planning.
Develop and maintain automated solutions for configuration management, deployment, monitoring, and alerting/self-healing.
Participate in on-call rotations, lead incident response efforts, and drive root cause analysis to prevent recurrence.
Define, measure, and track SLIs, SLOs, and SLAs, ensuring alignment with business and reliability goals.
Collaborate with application and infrastructure teams to design resilient, scalable, and secure architectures.
Adopt and leverage AI-powered solutions to optimize observability, anomaly detection, automated remediation, and operational forecasting. Implement and refine AI-assisted automation workflows to streamline incident management and reduce human intervention in repetitive tasks.
Continuously improve system performance, scalability, cost efficiency, and observability across production and pre-production environments.
Work closely with developers to integrate SRE and security practices into CI/CD pipelines and development workflows.
Lead and contribute to blameless postmortems and implement action plans to strengthen future resilience.
Document runbooks, operational workflows, and architectural decisions to ensure knowledge sharing and operational consistency.
Drive a culture of reliability engineering, automation, and AI adoption to enhance operational excellence and accelerate business innovation.

Things You'll Bring:

Strong expertise in Linux systems, networking, distributed architectures, and AWS cloud platforms.
Hands-on experience with Infrastructure as Code (IaC) tools such as Terraform, AWS CDK, and configuration management (Ansible, SaltStack).
Proven ability to build and maintain CI/CD pipelines and automate deployment workflows.
Deep knowledge of monitoring, observability, and alerting tools (Datadog, Prometheus, ELK, Grafana) with experience implementing self-healing systems.
Experience defining and tracking SLIs, SLOs, and SLAs, using data-driven insights to guide operational decisions.
Proficiency in incident response, root cause analysis, and blameless postmortems. Expertise in capacity planning, cost optimization, and performance tuning for largescale systems.
Familiarity with AI-driven operational tools for anomaly detection, predictive scaling, and intelligent alerting.
Experience integrating AI-assisted runbooks and automated remediation workflows to reduce MTTR.
Strong understanding of cloud-native architecture patterns, container orchestration (e.g., Kubernetes, EKS), and service meshes.
Ability to collaborate with developers to embed reliability, observability, and security best practices throughout the SDLC.
Excellent analytical and problem-solving skills, capable of diagnosing complex distributed system issues.
Effective communication and mentoring skills, fostering a culture of continuous learning and operational excellence.
Proficiency in documenting architecture, operational processes, runbooks, and AI/automation workflows.

Years of Work Experience: 5 - 7 years of experience
Education/Skills & Capabilities: Bachelor's Degree (4-year) : Information Technology, computer science, engineering, or relevant field preferred

We are a team built on purpose, not perfection.
The game is changing, and we're writing the new playbook. Our goals are ambitious, and we know that building the future requires diverse perspectives and skills. If you're excited about this role, but your experience doesn't align perfectly with every qualification, we still encourage you to apply.
We're looking for people who are accountable, customer-centric, and innovative. We believe that talent thrives when we empower leaders to grow and evolve. So, apply anyway. You might be just the right person for this role or another opportunity on our team

Benefits:
In addition to the above-mentioned salary, you will be entitled to all employment benefits established under Uruguayan labor law, including (but not limited to) 20 working days of annual paid vacation, vacation bonus and a thirteenth salary.

Perceptyx In The News 📰

Perceptyx Equal Employment Opportunity Policy:
Perceptyx celebrates diversity and an inclusive environment. We focus on providing an environment of mutual respect where equal employment opportunities are available to all employees and applicants for employment. We prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.
Perceptyx’s policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. All aspects of employment are decided on the basis of qualifications, knowledge, merit, and business needs.

Apply for this position

Required*

Apply with Indeed

First Name*

Last Name*

Email Address*

Phone*

Address

Resume*

We've received your resume. Click here to update it.

Attach resume or Paste resume

Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Are you legally authorized to work in Uruguay without requiring visa assistance?*

Do you currently reside in Montevideo Metropolitan Area?*

In this interview process, you must be able to answer all questions independently without the use of AI. Do you agree not to use any AI assistance, including large language models, to generate or refine your responses during our interview process?*

This role primarily entails sedentary work within a home office or office setting, including extensive computer use. The employee will frequently sit, stand, and walk, as well as engage in video/verbal communication and auditory perception. Tasks may involve manual dexterity for handling objects, tools, or controls, and repetitive hand or wrist movements with a keyboard and mouse.
- Do you acknowledge and accept the physical requirements of this position?*

Have you built and operated production AWS infrastructure using Terraform (or AWS CDK) and managed Kubernetes/EKS networking (ingress, load balancers, service-to-service comms)?*

Have you implemented AI-assisted operational workflows (e.g., anomaly detection, automated remediation) and/or used MCP to expose least-privilege runbooks in production?*

Are you willing to participate in a 24/7 on-call rotation and have you done so within the last 12 months?*

Describe a recent production incident you led end-to-end. What were the key SLIs/SLOs, root cause, tooling (e.g., Datadog/Prometheus/ELK), and the automation/remediation you implemented? Include before/after MTTR or error rate.*

Share how you designed a multi-account AWS setup (VPC/TGW/EKS) with Terraform/Ansible. How did you structure modules/state, enforce policy-as-code in CI/CD, and validate changes (tests, canary, rollback)?*

Outline an AI-assisted incident workflow that detects anomalies and, via MCP, invokes least-privilege runbooks (Terraform/Ansible/AWS APIs). What guardrails (RBAC, approvals, dry-runs, rate limits) and success metrics (precision, latency, MTTR, cost) would you use?*

Have you ever worked or currently work at Perceptyx? If YES, please provide your job title, dates of employment, and manager name. If NO, please type "NA".

Has anyone encouraged you to apply for this position? If YES, please let us know who. If NO, please type "NA".

What are your salary expectations for this position?*

Please provide your LinkedIn profile link below.*

What sparked your interest in applying to this position?*

The following questions are entirely optional.

To comply with government Equal Employment Opportunity and/or Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more.

Gender

Race/Ethnicity

Voluntary Self-Identification of Disability

Voluntary Self-Identification of Disability Form CC-305
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

Alcohol or other substance use disorder (not currently using drugs illegally)
Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
Blind or low vision
Cancer (past or present)
Cardiovascular or heart disease
Celiac disease
Cerebral palsy
Deaf or serious difficulty hearing
Diabetes
Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
Epilepsy or other seizure disorder
Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
Intellectual or developmental disability
Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
Missing limbs or partially missing limbs
Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
Partial or complete paralysis (any cause)
Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
Short stature (dwarfism)
Traumatic brain injury

Please check one of the boxes below:

YES, I HAVE A DISABILITY, OR HAVE HAD ONE IN THE PAST NO, I DO NOT HAVE A DISABILITY AND HAVE NOT HAD ONE IN THE PAST I DO NOT WANT TO ANSWER

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

You must enter your name and date
Name	Date

Human Check*

Submit Application

Thanks for visiting our Career Page.

Sr. Infrastructure/SRE Engineer

Apply for this position