Tinybird is seeking a Site Reliability Engineer to help scale and maintain the reliability and elasticity of their software and infrastructure.
The role involves participating in the on-call team to understand the product and client issues.
The technology stack includes Linux, OpenResty for SSL termination and load balancing, Varnish for load balancing and caching, Redis as a metadata store, Python for backend development, ClickHouse as the main data store, and Zookeeper for ClickHouse replicas coordination.
Monitoring and alerting are managed using Grafana, Loki, and Mimir.
The engineer will use Terraform for cloud provisioning and Ansible for software deployment and configuration.
Responsibilities include ensuring efficient use of hardware resources, automating machines, and collaborating with the product team to design and develop system architecture.
Key challenges include improving high-availability and elasticity, enhancing observability, and refining disaster recovery processes.
The role also involves automating customer account upgrades to allow for dynamic resource allocation.
Requirements:
Candidates must have experience designing, building, and running distributed cloud architectures and large-scale web applications.
Strong programming skills are required, with a willingness to engage with the codebase, particularly in Python and C++.
The ideal candidate should be accountable, enthusiastic, and unafraid to take responsibility for platform management.
A bias for action, iteration, and delivery is essential, with an understanding that decisions can often be reversed quickly.
Candidates should think in terms of systems and be aware of edge cases, failure modes, and specific implementations.
Comfort with asynchronous collaboration and direct daily communication within the team is necessary.
The ability to build intuitive and maintainable software, along with documenting key insights, is important.
Experience with OpenResty, Varnish, Redis, Terraform, or Ansible is preferred, but candidates are encouraged to recommend the right technologies for challenges.
Experience with ClickHouse and rolling out database systems at scale is a significant plus.
Benefits:
The salary range for this position is €62,000 - €109,000 per year, including cash compensation and stock options.
The final offer will consider factors such as experience, qualifications, skills, business needs, and geographical location.
Employees receive 22 days of holiday per year, plus their birthday and public holidays.
Comprehensive health benefits are provided.
There is freedom to work from any location, with up to €2,400 available to help set up a home workspace.
Tinybird promotes a fully remote work culture, with opportunities for face-to-face interactions at their offices in Madrid and New York City.
The company values transparency and keeps employees informed about ongoing developments.