Our client has been in the top 3 financial products to use in the USA and is trusted by more than 80,000 clients. They are looking for a Senior Site Reliability Engineer for their team, so we from CADABRA are helping them to find the perfect match.
Develop software systems and automated solutions for operational aspects in an organization. Site Reliability Engineer responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems can experience. Ultimately, you will work with our Engineering team to ensure our organization can continue to deliver products and services in our computer system environment.
They need their new teammate to:
- Monitor availability, latency, and overall system health
- Focus on automation and data mining opportunities for improved observability, maintainability, and security
- Create dashboards communicating the health of systems
- Bring up additional serving capacity
- Use the monitoring systems (for alerting and dashboards)
- Understand utilization trends and make scaling and decommissioning recommendations.
- Manage the production incident response process with a focus on minimizing MTTR
- Involved in the deployment process, helping to manage the risk involved with the introduction of change to production environments
- Contribute to capacity planning, demand forecasting, software performance analysis, and systems tuning
- Helps establish and monitor service level objectives for processes and systems
- Improve and keep documentation updated
- Takes proactive steps to address issues before they become problems
They need their new teammate to have strong experience:
- In information technology in the areas of development, quality assurance, and/or operations.
- Implementing and using system monitoring tools
- With continuous integration, deployment, and release management processes and tools
- Creative problem-solving skills with positive, action-oriented attitude and independent thinking
- Creating Physical and Logical design documents
- Work well in a distributed, fast-paced, and dynamic team environment
Requirements:
- 5 plus years of IT experience in development, quality assurance, and/or operations
- Experience with monitoring/alerting microservice architecture systems with Akka and OpenTelemetry
- Experience monitoring/alerting JVM applications using JMX
- Knowledge of enterprise Java frameworks
- Bachelor’s degree in computer science or related field preferred
- Demonstrated ability to solve problems and think beyond the path of least resistance, creating and realizing a vision for providing actionable insight into the health of our solutions
- Strong communication skills
As part of the team you will have:
- Competitive salary and performance-based bonuses
- Comprehensive health plan
- Convenient office location
- Flexible work schedule and work from home options
- 25 days paid annual leave
- Stocked kitchen and weekly lunches
- Sports card
Looking forward to talking with you! Let's do it at simona@cadabra.bg
(No. 2709/ 17.01.2019)