About Hashgraph:
Hashgraph is a fast-growing software company committed to supporting, developing and servicing Hedera, an open source, proof-of-stake platform. Hedera is EVM-compatible and has been specifically built to meet the needs of enterprise and Web3 applications, which require speed, security, stability and sustainability. Hedera’s public network is governed by industry-leading organizations, spanning 11 sectors and 14 regions who oversee the development and direction of the decentralized platform.
About the role:
Hashgraph is seeking an experienced DevOps Manager to lead our DevOps team in supporting the operations of consensus nodes across Hedera testnet, previewnet, and preproduction environments. This role requires a hands-on technical leader who can balance strategic planning with day-to-day operational excellence in our web3 infrastructure.
As the DevOps Manager, you will lead a team of operations engineers while remaining technically engaged in building automation, improving infrastructure as code, and coordinating with Hedera Governing Council members. You'll be responsible for team development, process optimization, and ensuring 24/7 operational readiness of critical Hedera network infrastructure.
This role requires strong technical expertise in cloud infrastructure (particularly GCP), infrastructure as code tools (Terraform, Ansible), and container orchestration (Kubernetes), combined with proven people management skills to mentor, grow, and retain top engineering talent.
You may find yourself doing all of the following:
- Lead and mentor a team of DevOps engineers, providing technical guidance and career development
- Manage day-to-day operations of Hedera production and preproduction infrastructure
- Coordinate with Hedera Governing Council members on operational matters and infrastructure requirements
- Design and implement automation solutions to reduce operational toil and improve efficiency
- Own and evolve infrastructure as code practices using Terraform and Ansible
- Establish and maintain incident management processes, including on-call rotations and post-mortem reviews
- Drive continuous improvement initiatives for monitoring, observability, and alerting systems
- Manage capacity planning and scaling strategies for cloud and bare metal infrastructure
- Ensure 24/7 operational readiness and lead response to critical incidents
- Lead hiring efforts to grow the DevOps team, including defining role requirements, interviewing candidates, and making hiring decisions
- Collaborate with development teams to improve CI/CD pipelines and deployment processes
- Define and track team KPIs, SLOs, and operational metrics
- Manage team budget and resource allocation
- Interface with senior leadership on strategic planning and technical roadmap
Qualification Requirements:
- B.S in Computer Science or a similar study
- 3+ years of people management experience leading DevOps or infrastructure engineering teams
- 7+ years of DevOps or software development experience
- 5+ years of experience running AWS / GCP / Azure cloud workloads at scale
- Strong hands-on experience with Terraform, Kubernetes, and Ansible
- Deeply familiar with operating and troubleshooting issues in a Linux environment
- Proven track record of building high-performing teams and developing engineering talent
- Experience with incident management, on-call rotations, and post-mortem processes
- Deeply familiar with DevOps and software development lifecycle best practices
- Strong written and verbal communication skills, including the ability to interface with senior leadership
- Comfortable leading a fully remote, distributed team across multiple time zones
Other skills that are great to bring with you but that we can help you develop:
- Experience in blockchain, web3, or distributed systems operations
- Familiarity with the LGTM stack and observability best practices
- Programming experience in Golang, Python, Bash, Java, or JavaScript
- Experience with Jenkins Pipelines, Github, and Github Actions
- Background in SRE principles and practices

