<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Observability on Company Engineering</title><link>https://company-engineering.pages.dev/tags/observability/</link><description>Recent content in Observability on Company Engineering</description><generator>Hugo</generator><language>en</language><lastBuildDate>Wed, 10 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://company-engineering.pages.dev/tags/observability/index.xml" rel="self" type="application/rss+xml"/><item><title>Observability Patterns for Distributed Systems</title><link>https://company-engineering.pages.dev/posts/observability-patterns-for-distributed-systems/</link><pubDate>Wed, 10 Jun 2026 00:00:00 +0000</pubDate><guid>https://company-engineering.pages.dev/posts/observability-patterns-for-distributed-systems/</guid><description>&lt;p&gt;When a request crosses a dozen services, &amp;ldquo;check the logs&amp;rdquo; stops being useful
advice. Observability is what lets us ask new questions about a running system
without shipping new code to answer them. These are the patterns we rely on.&lt;/p&gt;
&lt;h2 id="the-three-signals-and-what-each-is-for"&gt;The three signals, and what each is for&lt;/h2&gt;
&lt;p&gt;We treat metrics, logs, and traces as complementary rather than interchangeable:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Metrics&lt;/strong&gt; answer &lt;em&gt;how much&lt;/em&gt; and &lt;em&gt;how often&lt;/em&gt; — and they&amp;rsquo;re cheap enough to
keep for everything. They&amp;rsquo;re how we notice a problem.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Traces&lt;/strong&gt; answer &lt;em&gt;where&lt;/em&gt; — they show the path of a single request across
services and where the time went. They&amp;rsquo;re how we localize a problem.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Logs&lt;/strong&gt; answer &lt;em&gt;why&lt;/em&gt; — the detailed, contextual record of what a specific
component did. They&amp;rsquo;re how we explain a problem.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;You reach for them in that order: a metric alerts, a trace narrows it to a
service, logs explain the failure.&lt;/p&gt;</description></item></channel></rss>