Description
SS&C is a global provider of investment and financial services and software for the financial services and healthcare industries. Named to Fortune 1000 list as top U.S. company based on revenue, SS&C is headquartered in Windsor, Connecticut and has 20,000+ employees in over 90 offices in 35 countries. Some 18,000 financial services and healthcare organizations, from the world's largest institutions to local firms, manage and account for their investments using SS&C's products and services.
Job Description
About the Team:
Intralinks SRE team is considered the guardians of the production application platform. The main driving factor of the team is to ensure uninterrupted services to Intralinks' clients. You will be part of a global team that consistently looks for ways to improve the monitoring and availability of the platform.
Overview:
In this role, you will help the R&D with the root cause analysis, work on the operational tasks and automate monitoring and alerting that will minimize the MTTD and MTTR.
Day to Day:
- Responds to and resolves escalated incidents for customer issues or monitoring alerts.
- In-depth analysis of incident root cause;
- Working with R&D and architecture teams on defects and runtime inefficiencies identified in the production environment;
- Building diagnostic and analytical tools that improve the MTTA, MTTD, and MTTR.
- Building systems/site monitoring tools for system health and APIs to ensure smooth operations of production systems
- Configuring and integrating commercially available monitoring tools into the production systems
- Validate and Verify software deliverables for production readiness.
- Risk assessment and mitigation of changes to the production systems
Minimum Experience:
- Strong work experience in Unix/Linux
- Strong knowledge Java Web-based enterprise applications.
- Strong work experience and troubleshooting skills in Kubernetes systems
- Strong work experience in Microservices using Kubernetes and container applications
- Working experience of AWS with CloudWatch, EKS, EFS, S3, RedShift and other AWS services
- Working experience in using AWS tools to troubleshoot applications (resource constraints, connectivity, alerting, and monitoring)
- Sound knowledge in one of the major programming languages (such as Java) and performance tuning.
- Ability to automate mundane tasks using shell scripts, python, etc.
- Working Knowledge in basic networking and various application and transport protocols
- HTTP(s), JMS and etc.
- TCP, UDP
- Experience working with one or more of the following: Splunk, Dynatrace, Zabbix, Prometheus, etc.
- Experience working with one or more of the following: Oracle, PostgreSQL, MongoDB
- Experience working with messaging subsystems: RabbitMQ, Interconnect, AMQ.
- Working experience working with Jenkins, GIT
Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. SS&C offers excellent benefits including health, dental, 401k plan, tuition and professional development reimbursement plan. SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.
Apply on company website