Get started with Chkk for free today! No credit card required
Learn more
Learn more
Back to the blog
Technology
February 14, 2023

Collective Learning: The Power of Not Repeating Others’ Mistakes

Written by
Ali Khayam
X logoLinkedin logo
Start for free
Estimated Reading time
3 min

..it is often the mistakes of others that benefit the rest of us.” Nassim Taleb, Antifragile.

Mistakes are easier to point out in hindsight but very difficult to discern when you are making them. But mistakes, if learned from systematically, can benefit entire industries from never repeating them. A great example is the aviation industry which meticulously documents every near-miss and propagates these learnings to future flights as pre/post/in-flight checklists. This discipline ensures that every mistake makes all future flights safer. Shouldn’t something like this exist for software systems as well? Wouldn’t it be great to have the wisdom from all the mistakes of other developers at your fingertips, all the time?

Curating and programmatically delivering the wisdom of the software industry appeared impossible just a few years back. But technology has evolved to the point where we can now dream to live in that world where we can build tools with the sole purpose of collectively learning from and avoiding each others’ mistakes. Imagine a world where you just got notified about a traffic glitch that you are at risk of hitting, with information on how many other teams have already hit it, and what is the safest way to remediate it, before it becomes an incident/firefight for your team. Isn’t that powerful!

That’s the promise of the Collective Learning technology that we are building at Chkk. Chkk’s Collective Learning technology ensures that SRE / DevOps engineers reduce risk, avoid incidents, streamline upgrades, automate maintenance, and prioritize work using the wisdom of their peers, programmatically delivered to them. We are starting with DevOps/SREs and our focus is K8s where Collective Learning is direly needed. There are 1800+ CNCF components. Throw at least 4 cloud providers in this mix. Add a conservative estimate of millions of software deployments every week. Then factor in infrastructure changes induced by load conditions (e.g. autoscaling) There are too many ways things can break and it’s impossible to capture all of these conditions proactively.  Instead of trying to cover this large state space of potential issues, Collective Learning discovers and codifies known incidents and failures that the K8s community (comprising users/operators, cloud providers, and vendors) has already experienced, ensuring nobody repeats a mistake that has been made before. Large enterprises extend Collective Learning to share custom/proprietary lessons within their organization so internal teams don’t repeat each others’ mistakes.

We believe Collective Learning is the missing piece in the K8s operations puzzle and we are so excited to be building it with you. Our follow-up blogs will share more insights into how we are building and evolving Collective Learning and what role you can play in this movement.

..it is often the mistakes of others that benefit the rest of us.” Nassim Taleb, Antifragile.

Mistakes are easier to point out in hindsight but very difficult to discern when you are making them. But mistakes, if learned from systematically, can benefit entire industries from never repeating them. A great example is the aviation industry which meticulously documents every near-miss and propagates these learnings to future flights as pre/post/in-flight checklists. This discipline ensures that every mistake makes all future flights safer. Shouldn’t something like this exist for software systems as well? Wouldn’t it be great to have the wisdom from all the mistakes of other developers at your fingertips, all the time?

Curating and programmatically delivering the wisdom of the software industry appeared impossible just a few years back. But technology has evolved to the point where we can now dream to live in that world where we can build tools with the sole purpose of collectively learning from and avoiding each others’ mistakes. Imagine a world where you just got notified about a traffic glitch that you are at risk of hitting, with information on how many other teams have already hit it, and what is the safest way to remediate it, before it becomes an incident/firefight for your team. Isn’t that powerful!

That’s the promise of the Collective Learning technology that we are building at Chkk. Chkk’s Collective Learning technology ensures that SRE / DevOps engineers reduce risk, avoid incidents, streamline upgrades, automate maintenance, and prioritize work using the wisdom of their peers, programmatically delivered to them. We are starting with DevOps/SREs and our focus is K8s where Collective Learning is direly needed. There are 1800+ CNCF components. Throw at least 4 cloud providers in this mix. Add a conservative estimate of millions of software deployments every week. Then factor in infrastructure changes induced by load conditions (e.g. autoscaling) There are too many ways things can break and it’s impossible to capture all of these conditions proactively.  Instead of trying to cover this large state space of potential issues, Collective Learning discovers and codifies known incidents and failures that the K8s community (comprising users/operators, cloud providers, and vendors) has already experienced, ensuring nobody repeats a mistake that has been made before. Large enterprises extend Collective Learning to share custom/proprietary lessons within their organization so internal teams don’t repeat each others’ mistakes.

We believe Collective Learning is the missing piece in the K8s operations puzzle and we are so excited to be building it with you. Our follow-up blogs will share more insights into how we are building and evolving Collective Learning and what role you can play in this movement.

Tags
Collective Learning

Continue reading

Spotlight

Spotlight: Simplifying Contour Upgrades with Chkk

by
Chkk Team
Read more
Hidden Toil

5 Reasons Why Delaying Open Source Software Upgrades Is a Bad Idea

by
Awais Nemat
Read more
Spotlight

Spotlight: Seamless cert-manager Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: Argo Rollouts Upgrades with Chkk

by
Chkk Team
Read more
Upgrade Advisory

Upgrade Advisory: Pods Stuck in Pending During Kubelet v1.30 → v1.31 Upgrade

by
Chkk Team
Read more
Spotlight

Spotlight: Simplifying Self-Managed Apache Kafka Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: Seamless Calico Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: NGINX Ingress Controller Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: KEDA Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: Streamlining Prometheus Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: RabbitMQ Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: Seamless Kyverno Upgrades with Chkk

by
Chkk Team
Read more
News

Google Container Registry Deprecation 2025: How to Migrate to Artifact Registry

by
Chkk Team
Read more
Spotlight

Spotlight: HashiCorp Vault Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: Streamlining Crossplane Upgrades with Chkk

by
Chkk Team
Read more
Spotlight

Spotlight: Seamless External DNS Upgrades with Chkk

by
Chkk Team
Read more
Case Study

How Dexcom Derisked GKE Upgrades and Sped Them Up by 5x using Chkk

by
Chkk Team
Read more
Case Study

Assuring Compliance and Availability for Yoti’s On-Prem Platform with Chkk

by
Chkk Team
Read more
Case Study

How a Fortune 500 Enterprise Avoided $500K in EKS Extended Support Fees, Achieved 80% Reduction in Prep Time, and Boosted Upgrade Productivity by 200%

by
Chkk Team
Read more
Case Study

How a Fortune 1000 Enterprise Standardized Multi-Cloud (EKS & GKE) Upgrades for 30+ Add-Ons, Avoided 6x Costs, and Achieved an 80% Reduction in Prep Time

by
Chkk Team
Read more
Spotlight

Spotlight: Upgrading Self-Managed Redis

by
Chkk Team
Read more
Spotlight

Spotlight: Simplifying Self-Managed Elasticsearch Upgrades with Chkk

by
Chkk Team
Read more
News

GKE & EKS Extended Support: Are 6x Fees for Supporting Older Kubernetes Versions Justified?

by
Ali Khayam
Read more
Spotlight

Spotlight: Seamless Karpenter Upgrades with Chkk

by
Chkk Team
Read more
Operational Safety

Forced EKS & GKE Upgrades: How to Manage Business Continuity Risks

by
Fawad Khaliq
Read more
Spotlight

Spotlight: How Chkk Streamlines & Safeguards Cilium Upgrades

by
Chkk Team
Read more
Technology

Kubernetes Admission Controllers and Webhooks Deep Dive

by
Chkk Team
Read more
Spotlight

Chkk Spotlight: Istio

by
Chkk Team
Read more
Technology

Pod Disruption Budgets: Pitfalls, Evictions & Kubernetes Upgrades

by
Chkk Team
Read more
Technology

cgroup v1 to v2 Migration in Kubernetes

by
Chkk Team
Read more
Operational Safety

OpenAI’s Outage: The Complexity and Fragility of Modern AI Infrastructure on Kubernetes

by
Fawad Khaliq
Read more
News

EKS launches Auto Mode… How can you adopt it?

by
Ali Khayam
Read more
Change Safety

CrowdStrike outage was the symptom; missing Operational Safety was the cause

by
Fawad Khaliq
Read more
News

GKE Follows EKS & AKS, Launches Extended Support with a 500% Surcharge for Delayed Upgrade

by
Ali Khayam
Read more
News

AKS Long Term Support and EKS Extended Support: Similarities & Differences

by
Ali Khayam
Read more
News

Amazon launches EKS extended support… How does it impact you?

by
Ali Khayam
Read more
Platform Engineering

Platform teams need a delightfully different approach, not one that sucks less

by
Fawad Khaliq
Read more
Technology

Kubernetes Enters Its Second Decade: Insights from KubeCon Chicago

by
Fawad Khaliq
Read more
Company

Launching Chkk Operational Safety Platform

by
Awais Nemat
Read more
Technology

What Makes Kubernetes Upgrades So Challenging?

by
Fawad Khaliq
Read more
Company

4 Lessons from our SOC2 Journey

by
Fawad Khaliq
Read more
Technology

Collective Learning: The Power of Not Repeating Others’ Mistakes

by
Ali Khayam
Read more
Technology

From Fighting Fires to Availability Assurance

by
Fawad Khaliq
Read more
Company

Welcome to Chkk

by
Awais Nemat
Read more