of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and More ❯
skills. Strong interpersonal skills. Desired Criteria Experience in working with modern CI/CD and engineering tooling - we leverage Amazon Web Services, GitHub Actions, Datadog and Terraform extensively. Experience with microservice-oriented architectures. Experience with agile methodologies. Experience with Typescript. Experience with PostgreSQL (or similar) and ORM frameworks. Why join More ❯
models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration and Mentorship: Collaborate effectively with data scientists, engineers, and other stakeholders. Provide guidance and support to junior team … CD pipelines Containerization and Orchestration: Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes) Monitoring and Logging: Experience with monitoring and logging tools like DataDog, Prometheus, or Grafana Data Engineering Skills: Knowledge of event streaming platforms (e.g., Apache Kafka) and SQL database management Strong Communication and Collaboration: Excellent communication skills More ❯