Halcyon is the industry’s first dedicated, adaptive security platform that combines multiple proprietary advanced prevention engines along with AI models focused specifically on stopping ransomware.
The company was formed in 2021 by a team of cyber industry veterans with experience at major global security vendors.
The role of Cloud Observability and Performance Engineer is part of the Chaos Cloud Engineering team.
The engineer will design and implement observability, monitoring, and performance strategies for cloud-hosted microservices managing endpoint security agents at scale.
Responsibilities include designing end-to-end observability for distributed cloud services, developing metrics pipelines and dashboards, ensuring high performance and availability of systems, collaborating with teams to troubleshoot issues, defining SLOs and SLIs, instrumenting code for metrics, automating performance testing, and providing root cause analysis.
Requirements:
Candidates must have 5+ years of professional experience in observability, site reliability, or cloud performance roles.
Strong experience with monitoring and observability stacks such as Prometheus, Grafana, ELK, OpenTelemetry, Datadog, and AWS CloudWatch is required.
Proficiency in cloud platforms like AWS, GCP, and Azure, as well as cloud-native services, is necessary.
Experience with containerization and orchestration tools like Docker and Kubernetes is essential.
Hands-on experience in designing and implementing performance or load testing frameworks for distributed systems is required.
Candidates should be able to define and validate throughput baselines and latency thresholds under real-world traffic scenarios.
Solid knowledge of distributed systems, microservices, and performance debugging is necessary.
Proficiency in Python, Scala, or other languages for tooling and automation is required.
Familiarity with CI/CD pipelines, infrastructure as code (e.g., Terraform), and version control (Git) is necessary.
Candidates must be able to participate in an on-call rotation to support observability infrastructure.
Benefits:
Halcyon offers comprehensive healthcare (medical, dental, and vision) with premiums paid in full for employees and dependents.
A 401k plan with a generous employer contribution is provided.
Short and long-term disability coverage, as well as basic life and AD&D insurance plans, are included.
Medical and dependent care FSA options are available.
The company has a flexible PTO policy and offers parental leave.
Employees receive a generous equity offering.
The base salary range for this position is $150,000 - $185,000, with a bonus target of 10%.