Apply on
About the Role
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Swift's services—both our internally critical and externally visible systems—have reliability, uptime appropriate to users' needs, and a fast rate of improvement. Additionally, SREs maintain vigilant oversight of system capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating manual work through automation.As a Senior Site Reliability Engineering Manager, you will lead a team responsible for providing the platform for mission-critical systems to maintain constant uptime, scale seamlessly, and enable new applications and services to flourish. The successful candidate will be highly self-motivated with a passion for excellence, quality, and detail. The SRE Manager will support operations, collaborate with developers and architects to design systems, and assist in implementation to improve stability, security, and scalability.
What to Expect?
Team Building and Mentorship:
- Recruit and retain engineers with diverse perspectives.
- Provide coaching, mentorship, and career development support to ensure team members excel both technically and personally.
Collaboration and Alignment:
- Partner with Product Owners and Engineering Leads to align SRE members with cross-functional squads.
- Foster effective collaboration across teams and functions.
Technical Leadership:
- Guide software design patterns, architecture, and engineering best practices.
- Drive design-focused software delivery to enhance quality and scalability.
Continuous Learning and Knowledge Sharing:
- Promote a culture of learning, knowledge sharing, and excellence across the organization.
- Encourage adoption of consistent practices across teams.
Driving Innovation:
- Stay updated on technology trends and facilitate the adoption of new tools and methodologies.
- Encourage innovative thinking to drive team and organizational growth.
Performance Management:
- Set annual objectives for team members.
- Conduct performance appraisals and provide constructive feedback to support career development.
What Will Make You Successful?
Professional Skills
- Bachelor's or higher degree in Computer Science, Engineering, or related disciplines.
- Strong communication and leadership skills, promoting a diverse and collaborative culture.
- Passion for people development and commitment to creating an inclusive work environment.
- Customer-oriented and quality-focused mindset with a drive to deliver true customer value.
- Open-minded, solutions-oriented team player energized by collaboration.
- Familiarity with Agile and DevOps practices.
- Fluency in English (spoken and written).
- Experience in observability and/or anomaly detection is a plus.
Key Qualifications
- 8+ years of experience in software development using one or more programming languages.
- Expertise in designing, analyzing, and troubleshooting distributed systems.
- 5+ years of leadership experience managing and mentoring technical teams.
- Skilled in cross-functional collaboration to achieve project success.
- Strong passion for automation and reducing manual workloads.
- Proven ability to encourage a culture of visibility and transparency across teams.
- Experience managing enterprise services in large-scale Linux environments.
- Expertise with Kubernetes and configuration management tools like Puppet, Chef, or Ansible.
- Proficiency in troubleshooting issues across the entire software stack.
- Hands-on experience operating large-scale multi-tenant infrastructure as a managed service.
- Strong verbal and written communication skills.
Additional Requirements
- Advocacy for automation to minimize operational workloads.
- Strong sense of ownership, coupled with a collaborative and transparent communication style.
- Self-motivated and inquisitive, always eager to learn and improve systems and processes.
About the Team
On the SRE team, you’ll tackle the complex challenges of scale unique to Swift, leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. Our culture values diversity, intellectual curiosity, problem-solving, and openness. We encourage collaboration, big thinking, and risk-taking in a supportive, blame-free environment. SRE promotes self-direction to work on meaningful projects while fostering a learning environment that provides the mentorship needed to grow and succeed.
What we offer
We give you the freedom to be yourself. We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. An environment in which everyone’s voice counts and where you can reach your full potential regardless of age, background, culture, colour, disability, gender, nationality, race, religion, or veteran/military status.