Back to Search Results
Get alerts for jobs like this Get jobs like this tweeted to you
Company: SS&C Technologies
Location: London, United Kingdom
Career Level: Mid-Senior Level
Industries: Technology, Software, IT, Electronics

Description

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Job Description

Be part of a global team that ensures the performance, scalability, and reliability of critical cloud-based applications. As part of the Global Investor and Distribution Solutions (GIDS) Platform Services team, you'll play a key role in keeping our systems running smoothly and efficiently while helping shape the future of our platform. 

 

Key Responsibilities:  

  • Collaborate with global teams as part of a follow-the-sun support model. 

  • Respond to, troubleshoot, and resolve Level 2 application incidents. 

  • Ensure critical applications are effectively monitored using tools like Prometheus and Grafana. 

  • Create and maintain dashboards and alerts to enhance visibility into application health. 

  • Define, implement, and track key SRE metrics (SLOs, SLIs, error budgets). 

  • Partner with development teams to improve application reliability and resilience. 

  • Analyse incident trends and recommend improvements to reduce recurrence. 

  • Automate repetitive support tasks to improve efficiency. 

  • Participate in post-incident reviews and drive reliability initiatives. 

  • Perform infrastructure and application patching as part of regular maintenance cycles. 

  • Support security vulnerability remediation efforts across both infrastructure and application layers. 

 

 

Qualifications: 

Minimum Qualification 

  • Bachelor's degree in Computer Science, Computer Engineering, IT, or related field. 

  • At least 3+ years of experience in a similar role.

  • Proficiency in one or more programming languages, preferably Java, JavaScript or Python.  

  • Proven ability to troubleshoot complex systems

  • Skilled in debugging, code optimisation, and automation

  • Experience with relational databases and data analysis. 

 

 

Highly Preferred 

  • Experience working in Site Reliability Engineer (SRE) roles or incident response environments. 

  • Hands-on experience with cloud infrastructure, preferably AWS. 

  • Familiarity with observability tools such as Grafana, ELK Stack, or similar. 

  • Experience deploying and managing applications on Kubernetes platforms. 

  • Strong skills in analysing and troubleshooting issues in large-scale, distributed systems. 

  • Familiarity with PostgreSQL and its performance tuning, monitoring, and troubleshooting. 

We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives to our thinking and conversation. It's important to us that we strive to have a workforce that is diverse in the widest sense.

Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.


 Apply on company website