Incident management (advanced)
Team Management (advanced)
Egnyte is a product-focused company. We build and scale our flagship product : a secure content platform called Egnyte used by companies like Red Bull, IKEA, and Yamaha.
It’s a large-scale system with 16,000+ customers and over 6000 instances across geo-distributed Data Centers and Cloud providers.
Our customers can access and manage their data through different devices and interfaces like mobile, desktop apps, or WebUI.
For more information about the product, visit our website : www.egnyte.com.
The opportunity :
We have over 230 employees in Poland and are still increasing the size of our team driven by growing business demands and opportunities.
To support this goal, we’re looking for an SRE Manager for the Observability Team with experience in Data Center Operations, Service Reliability, and Engineering awareness in a SaaS environment.
We are looking for an experienced leader who will be working directly with the engineers, hand in hand with other members of the Operations and Engineering leadership team.
A person who makes sure engineers are working on the right things and enabling their growth.
As an SRE Manager, you will be reporting to the Director of SRE. At the same time, you will enjoy full autonomy regarding the team you manage, and focus on service reliability in accordance with target SLOs and SLAs.
Your day-to-day at Egnyte :
Egnyte is a remote-friendly organization, so our employees have the possibility to work from their home offices or Polish HQ located next to the scenic Lake Malta.
Here’s what you will do as an SRE Manager, Observability at Egnyte :
Participate in daily status meetings with the team working on-site and remotely
Effective team building :
direct involvement in the recruitment process
building long term team competency in the SRE Observation team
setting up career development plans for team members
providing business context regarding the items the team needs to deliver
lead the existing SRE Observability team
Provide service reliability
Promote good practices in service management and manage technical operations in a 24 / 7 model
Manage troubleshooting skills in the team to hunt down the root causes of issues and persistence in preventing them from happening again
Ensure all important data is collected and managed
Stimulate dialogue between Operations and Engineering teams :
to ensure the requirements are well defined and known
to ensure a variety of implementations are properly prioritized
Be an active member of the Operations Team who participates in operations and engineering-wide processes improvement initiatives
Promote good practices in service management
Drive innovation by encouraging and supporting your team in looking for smart solutions, also influencing already existing IT processes to improve the work environment
Improve system stability by better visibility and reporting
About you :
2+ years of experience in an operations / monitoring / service leadership role, ideally in SaaS company
Experience in managing and building a geographically dispersed team
Good judgment when to step in and when to step back to give your team space to act
Experience within Incident and Request Management, including SWAT calls
Strong leadership, ownership and facilitation skills
Advanced communication skills
Solid English in speech and writing
Bonus points :
Understanding of cloud infrastructure and DevOps tooling
Previous experience in a technical role