Skip to content
de. ← All Projects
  • Home
  • Projects
  • About
  • Resume
Contact
Open to roles
← All Projects

Auto-Ops Dashboard.

An operations surface for cluster observability that improves triage speed and decision quality under pressure.

Company Opster
Timeline 4 weeks
Team
• Product Manager • Product Designer • Front-End Lead • Back-End Lead • DevOps (user)
Project Summary

A centralized dashboard for cluster observability that helps operations teams identify high-priority issues faster and act with stronger context.

Role Overview

Led UX strategy and product design delivery across discovery, information architecture, dashboard interactions, and implementation-ready specs.

Design Process.

How we moved from a fuzzy problem to a shipped solution.

Problem framing
Alert overload made prioritization unclear

Operators were switching across tools to correlate health signals, which delayed triage and reduced confidence in next-best actions.

Project Goal
Consolidate observability signals into one decisive surface

Build a dashboard that highlights urgency, clarifies causality, and supports quick drill-down actions under time pressure.

Research & Takeaways

What users were actually saying

We analyzed monitoring workflows and incident handoffs to map bottlenecks in triage. Teams needed clearer prioritization, tighter context, and less visual noise.

Incident triage workflow and research synthesis map
Caption: Incident triage workflow synthesis
Wireframes

Exploring the solution space

Wireframes tested hierarchy and triage-first patterns for alert context, risk scoring, and actionability. We validated the fastest route from signal to decision.

Auto-Ops Dashboard wireframe exploration 1
Auto-Ops Dashboard wireframe exploration 2
Auto-Ops Dashboard wireframes overview
Results

What shipped and what moved

The dashboard improved issue prioritization and reduced navigation overhead by centralizing critical observability context. Mean time to detection dropped from ~18 minutes to under 2 minutes, cutting alert fatigue by 35%. Severity-based triage surfaced the right incidents without manual filtering, saving SRE teams 3–4 hours daily. Incidents resolved 45–70% faster through contextual runbooks and copy-ready remediation commands. Multi-cluster visibility consolidated into a single pane, reducing downtime from ~2.3 hours/month to under 25 minutes.

Auto-Ops Dashboard shipped UI — hero
Auto-Ops Dashboard shipped UI — detail 1
Auto-Ops Dashboard shipped UI — detail 2
Auto-Ops dashboard — critical event detail with timeline, detection summary, and remediation recommendations
Auto-Ops dashboard — events-over-time heatmap, all events list with severity, and resources plus performance metrics
Next project
Pipeline Builder for Scientists
→
David Eskenazi portrait
David Eskenazi
Senior Product Designer · AI and B2B SaaS
daveesk9@gmail.com 058.4986.123

© 2026 David Eskenazi. All rights reserved.

✶ 100% vibe coded site
Home · Projects · About · Resume
Project preview
Home About
Projects
Résumé Email