Closes in 2 days

Engineering Manager, Infrastructure Team

Czechia CompetitiveRemote0 applicants

About this role

Bloomreach is building the world’s premier agentic platform for personalization.We’re revolutionizing how businesses connect with their customers, building and deploying AI agents to personalize the entire customer journey.

We're taking autonomous search mainstream, making product discovery more intuitive and conversational for customers, and more profitable for businesses.

We’re making conversational shopping a reality, connecting every shopper with tailored guidance and product expertise — available on demand, at every touchpoint in their journey.

We're designing the future of autonomous marketing, taking the work out of workflows, and reclaiming the creative, strategic, and customer-first work marketers were always meant to do.

And we're building all of that on the intelligence of a single AI engine — Loomi AI — so that personalization isn't only autonomous…it's also consistent.From retail to financial services, hospitality to gaming, businesses use Bloomreach to drive higher growth and lasting loyalty. We power personalization for more than 1,400 global brands, including American Eagle, Sonepar, and Pandora.

Become an Engineering Manager at Bloomreach and lead our Infrastructure team as a core foundation for product engineering. In this role, you will own the platform capabilities that power personalization across our clients’ websites and mobile apps, enabling fast experimentation and multi-variant testing at scale.

You will lead the team responsible for our core infrastructure stack - Google Cloud Platform (GCP), databases, observability platform, and Kubernetes, and partner closely with Product, Security, and application engineering teams to ensure our platform is reliable, scalable, secure, cost-efficient, and developer-friendly.

Your leadership and strategic direction will impact hundreds of millions of end customers across diverse e-commerce verticals. This is a full-time role based in one of our Central European offices (Bratislava, Brno, Prague) or remote.

What You’ll Do

Lead and grow a high-performing Infrastructure Engineering team (hiring, mentoring, performance, career development).

Organize team activities and delivery rhythm - lead standups, sprint planning, backlog refinement, retrospectives, and other rituals that keep the team aligned, unblock work early, and continuously improve how we operate.

Own infrastructure strategy and execution across:

GCP services and cloud architecture

Kubernetes platform operations and evolution

Database reliability, performance, and lifecycle management

Observability platform (metrics, logs, traces, alerting, dashboards)

Act as a foundation team leader for application teams by providing:

cloud architecture expertise and consultation,

shared infrastructure standards and best practices,

reusable platform tooling and self-service capabilities.

Drive reliability and operational excellence through SLOs/SLIs, incident management, postmortems, and automation.

Establish strong partnership with product and application engineering teams to improve developer experience and delivery speed.

Collaborate with Security and Compliance teams to enforce best practices for access control, data protection, and auditability.

Own infrastructure financial governance, including:

cost monitoring and visibility across cloud resources,

cost optimization initiatives and accountability with engineering teams.

Key Responsibilities by Domain

GCP

Define and maintain cloud architecture standards, IAM model, and network/security guardrails.

Improve scalability, resilience, and cost efficiency of GCP-based services.

Implement cloud cost monitoring practices (tagging, allocation, dashboards, anomaly detection).

Kubernetes Platform

Own Kubernetes cluster strategy, upgrades, policy enforcement, and platform reliability.

Standardize deployment patterns and improve CI/CD integration for application teams.

Databases

Ensure availability, performance, backup/restore, and disaster recovery for production databases.

Guide database capacity planning, migration strategies, and operational best practices.

Observability

Build and maintain a unified observability stack for logs, metrics, traces, and alerting.

Ensure teams can detect, troubleshoot, and resolve incidents quickly with actionable telemetry.

Foundation Enablement

Provide expert guidance and platform tooling that enables application teams to operate effectively on cloud infrastructure.

Promote infrastructure best practices through documentation, training, and hands-on support.

What We’re Looking For

5+ years in software/platform/infrastructure engineering, with 2+ years in people leadership.

Proven experience managing infrastructure teams in cloud-native environments.

Strong hands-on understanding of:

GCP services (compute, networking, IAM, storage, monitoring),

Kubernetes in production,

database operations (SQL/NoSQL, replication, backup/DR, tuning),

observability tooling and SRE practices.

Experience building foundation/platform capabilities for multiple engineering teams.

Experience with cloud cost monitoring, budget ownership, and cost optimization at scale.

Strong communication and stakeholder management skills across technical and non-technical audiences.

Ability to set strategy while remaining pragmatic and execution-focused.

Responsibilities

  • We're taking autonomous search mainstream, making product discovery more intuitive and conversational for customers, and more profitable for businesses.
  • We’re making conversational shopping a reality, connecting every shopper with tailored guidance and product expertise — available on demand, at every touchpoint in their journey.
  • We're designing the future of autonomous marketing, taking the work out of workflows, and reclaiming the creative, strategic, and customer-first work marketers were always meant to do.
  • Lead and grow a high-performing Infrastructure Engineering team (hiring, mentoring, performance, career development).
  • Organize team activities and delivery rhythm - lead standups, sprint planning, backlog refinement, retrospectives, and other rituals that keep the team aligned, unblock work early, and continuously improve how we operate.
  • Own infrastructure strategy and execution across:
  • GCP services and cloud architecture
  • Kubernetes platform operations and evolution
  • Database reliability, performance, and lifecycle management
  • Observability platform (metrics, logs, traces, alerting, dashboards)
  • Act as a foundation team leader for application teams by providing:
  • cloud architecture expertise and consultation,
  • shared infrastructure standards and best practices,
  • reusable platform tooling and self-service capabilities.
  • Drive reliability and operational excellence through SLOs/SLIs, incident management, postmortems, and automation.
  • Establish strong partnership with product and application engineering teams to improve developer experience and delivery speed.
  • Collaborate with Security and Compliance teams to enforce best practices for access control, data protection, and auditability.
  • Own infrastructure financial governance, including:
  • cost monitoring and visibility across cloud resources,
  • cost optimization initiatives and accountability with engineering teams.
  • Define and maintain cloud architecture standards, IAM model, and network/security guardrails.
  • Improve scalability, resilience, and cost efficiency of GCP-based services.
  • Implement cloud cost monitoring practices (tagging, allocation, dashboards, anomaly detection).
  • Own Kubernetes cluster strategy, upgrades, policy enforcement, and platform reliability.
  • Standardize deployment patterns and improve CI/CD integration for application teams.
  • Ensure availability, performance, backup/restore, and disaster recovery for production databases.
  • Guide database capacity planning, migration strategies, and operational best practices.
  • Build and maintain a unified observability stack for logs, metrics, traces, and alerting.
  • Ensure teams can detect, troubleshoot, and resolve incidents quickly with actionable telemetry.
  • Provide expert guidance and platform tooling that enables application teams to operate effectively on cloud infrastructure.
  • Promote infrastructure best practices through documentation, training, and hands-on support.
  • 5+ years in software/platform/infrastructure engineering, with 2+ years in people leadership.
  • Proven experience managing infrastructure teams in cloud-native environments.
  • Strong hands-on understanding of:
  • GCP services (compute, networking, IAM, storage, monitoring),

Requirements

  • Kubernetes in production,
  • database operations (SQL/NoSQL, replication, backup/DR, tuning),
  • observability tooling and SRE practices.
  • Experience building foundation/platform capabilities for multiple engineering teams.
  • Experience with cloud cost monitoring, budget ownership, and cost optimization at scale.
  • Strong communication and stakeholder management skills across technical and non-technical audiences.
  • Ability to set strategy while remaining pragmatic and execution-focused.
  • Experience with Infrastructure as Code (Terraform) and policy-as-code.
  • Experience with service mesh, platform APIs, or internal developer platforms.
  • Familiarity with SOC2/ISO27001/GDPR-related infrastructure controls.
  • Experience supporting multi-region or global-scale systems.
  • First 30 days: Understand architecture, team dynamics, and reliability/cost baseline; establish team operating cadence.
  • First 90 days: Deliver improvements in platform reliability and observability coverage; introduce clear cloud cost visibility and ownership.
  • First 180 days: Improve reliability and deployment efficiency metrics, reduce operational toil, and achieve measurable infrastructure cost optimization.

Nice to have

  • Experience with Infrastructure as Code (Terraform) and policy-as-code.
  • Experience with service mesh, platform APIs, or internal developer platforms.
  • Familiarity with SOC2/ISO27001/GDPR-related infrastructure controls.
  • Experience supporting multi-region or global-scale systems.
  • Success in This Role
  • First 30 days: Understand architecture, team dynamics, and reliability/cost baseline; establish team operating cadence.
  • First 90 days: Deliver improvements in platform reliability and observability coverage; introduce clear cloud cost visibility and ownership.
  • First 180 days: Improve reliability and deployment efficiency metrics, reduce operational toil, and achieve measurable infrastructure cost optimization.

EU Requirements

Job Details

Posted14 May 2026
Closes13 June 2026
Work ModeRemote

Contact

Similar Jobs

Finding similar jobs...