
Description
As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.
Job Description
Title: Site Reliability Engineer (SRE), Cloud Incident Response
Location: Melbourne, Australia |Sydney, Australia | Hybrid
Job Description
Get To Know Us:
SS&C GIDS provides information processing and computer software services and products. The Company's operating segments include financial markets, customer management, professional services, and output solutions. SS&C GIDS serves the alternative investments, asset and wealth management, banking and lending, insurance, and real estate industries.
Why You Will Love It Here!
- Flexibility: Hybrid Work Model
- Your Future: Income Protection Insurance & Salary Continuance
- Work/Life Balance: Generous Bereavement & Compassionate leave
- Your Wellbeing: Private Health Insurance discount, Primary & Secondary Paid Parental leave, Death & TPD Insurance
- Diversity & Inclusion: Committed to Welcoming, Celebrating and Thriving on Diversity
- Training: Hands-On, Team-Customized, including SS&C University
- Extra Perks: Discounts on fitness clubs, travel and more!
What You Will Get To Do:
- Collaborate with global teams as part of a follow-the-sun support model.
- Respond to, troubleshoot, and resolve Level 2 application incidents.
- Ensure critical applications are effectively monitored using tools like Prometheus and Grafana.
- Create and maintain dashboards and alerts to enhance visibility into application health.
- Define, implement, and track key SRE metrics (SLOs, SLIs, error budgets).
- Partner with development teams to improve application reliability and resilience.
- Analyze incident trends and recommend improvements to reduce recurrence.
- Automate repetitive support tasks to improve efficiency.
- Participate in post-incident reviews and drive reliability initiatives.
What You Will Bring:
Minimum Qualification
- Bachelor's degree in Computer Science, Computer Engineering, IT, or related field.
- 5+ years of experience for senior roles; fresh graduates welcome for junior roles.
- Proficiency in one or more programming languages, preferably Java, JavaScript or Python.
- Proven ability to troubleshoot complex systems.
- Skilled in debugging, code optimization, and automation.
- Experience with relational databases and data analysis.
Highly Preferred
- Experience working in Site Reliable Engineer (SRE) roles or incident response environments.
- Hands-on experience with cloud infrastructure, preferably AWS.
- Familiarity with observability tools such as Grafana, ELK Stack, or similar.
- Experience deploying and managing applications on Kubernetes platforms.
- Strong skills in analyzing and troubleshooting issues in large-scale, distributed systems.
We encourage applications from people of all backgrounds to enable us to bring diverse perspectives to our thinking and conversation. It's important to us that we strive to have a workforce that is diverse in the widest sense.
Thank you for your interest in SS&C! If applicable, to further explore this opportunity, please apply directly with us through our Careers page on our corporate website @ www.ssctech.com/careers.
Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.
SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.
Apply on company website