Systems & Monitoring Engineer

Festanstellung, Vollzeit · India, Remote

Your mission

The Hashgraph Group (THG) is a global organization headquartered in Switzerland, and is a part of the Hedera Hashgraph (“Hedera”) ecosystem.

Hedera is a revolutionary proof-of-stake public Distributed Ledger Technology (DLT) network that is fast emerging as the gold standard in DLT for enterprise-grade solutions and decentralized applications (dApps). Hedera is governed by a council of the world’s leading organizations - which include Google, Boeing, IBM, Dell, Deutsche Telekom, LG, Abrdn, London School of Economics, to name a few.

THG works closely with enterprises, startups, governments, and academic and training institutions around the world to deliver financing, custom-design solutions, and professional training and innovation programs, aimed at accelerating the development and utilization of the Hedera Hashgraph network.

Your profile

About the Role:

Are you passionate about next-gen observability, automation, and operational excellence? As our Systems & Monitoring Engineer, you’ll architect and own the monitoring stack for our Hedera-based ecosystem, blending classic NOC best practices with the unique challenges of DLT and Web3. You’ll be the technical backbone ensuring uptime, resilience, and regulatory compliance for our global support teams.

What You’ll Do

1) Web3 Observability:

Design, deploy, and maintain monitoring solutions (Prometheus, Grafana) for DLT-specific metrics (consensus finality, node health, on-chain activity).
Build custom exporters and dashboards for real-time, actionable insights.
Distinguish between infrastructure and protocol health to ensure meaningful alerts.

2) Incident Response & Compliance:

Integrate and manage PagerDuty for rapid, automated incident response.
Implement DORA-compliant processes, including automated “kill switches” and regular disaster recovery drills.
Maintain clear, actionable runbooks for support teams.

3) Automation & Infrastructure as Code:

Deploy and manage Mirror Nodes and RPC relays using Terraform/Ansible across AWS/GCP.
Build CI/CD pipelines for support tooling and state proof verification.
Automate critical response actions for rapid threat mitigation.

4) NOC Leadership:

Serve as the L3 escalation point for complex incidents (“ghost transactions,” API anomalies).
Perform root cause analysis using logs (Splunk, Datadog) and collaborate with cross-functional teams.

What You Bring

4+ years in DevOps, SRE, or NOC roles (with 1–2 years in Web3/Blockchain environments).
Deep expertise in Prometheus/Grafana, Linux, Docker/Kubernetes, and scripting (Python, Go, Bash).
Proven experience with cloud platforms (AWS/GCP) and IaC tools (Terraform).
Strong understanding of Hedera Hashgraph or EVM-based chains, and ability to interpret ledger APIs.
Familiarity with ITIL/ITSM, DORA, SOC2, or ISO 27001 frameworks.

Why us?

What we offer

A unique opportunity to be a part of the world’s leading DLT ecosystem
Significant career growth potential in a fast growing sector
Working with colleagues and on projects across the globe
Open and direct communication, flat structures
Flexible working hours
Competitive salary package

Auf diese Stelle bewerben

About us

https://www.hashgraph-group.com/

Auf diese Stelle bewerben

Wir freuen uns auf Sie!

Wir freuen uns über Ihr Interesse an der Demo Daten GmbH. Bitte füllen Sie das folgende kurze Formular aus. Sollten Sie Schwierigkeiten mit dem Upload Ihrer Daten haben, wende Sie sich gerne per Email an demodaten@demo.de.