Join the team redefining how the world experiences design at Canva.
The role is open to remote work across Australia and New Zealand.
You will be responsible for building and improving the observability platform and tooling used by all Canva engineers.
Provide technical leadership and expertise to drive pragmatic solutions and achieve impactful design decisions.
Engage in brainstorming, researching, and prototyping to optimize tracing and exceptions platforms, improve operational effectiveness, and increase reliability.
Proactively improve the tracing user experience and advocate for best practices.
Find ways to enhance the use of traces and exceptions, providing better insights to engineers.
Enhance the exception workflow to help engineers capture errors, gain actionable insights through clear visualizations, and set up high-signal, low-noise alerts.
Participate in team ceremonies, knowledge sharing, and brainstorming sessions.
Become an observability champion, evangelizing best practices and guiding other Canvanauts in the observability space.
Requirements:
You must be proficient and comfortable coding in Python, Java, or Golang.
A deep knowledge and understanding of Computer Engineering fundamentals and first principles is required.
Solid knowledge of AWS services such as EC2, EKS, Lambda, SQS, Kinesis, and S3 or equivalent is necessary.
Experience deploying and running containerized workloads on platforms like Kubernetes is essential.
You should have experience with Observability Tooling, including tools like Elasticsearch, Grafana, Sentry, Jaeger Tracing, or similar.
Experience running highly available and reliable distributed systems with scalable data stores is required.
Proficiency with infrastructure-as-code is necessary; experience with Terraform is preferred, but strong experience with other IaC tools is acceptable.
Helpful but not essential experience includes familiarity with OpenTelemetry, writing application code in Java or frontend code in TypeScript, building and running monitoring infrastructure at scale, handling data at scale, and experience with Clickhouse.
Experience with data security, data obfuscation, and PII detection is also beneficial.
Benefits:
Equity packages are offered to ensure that your success aligns with the company's success.
An inclusive parental leave policy supports all parents and caregivers.
An annual Vibe & Thrive allowance is provided to support your wellbeing, social connection, office setup, and more.
Flexible leave options empower you to take time to recharge and support your personal needs.