<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Posts on Company Engineering</title><link>https://company-engineering.pages.dev/posts/</link><description>Recent content in Posts on Company Engineering</description><generator>Hugo</generator><language>en</language><lastBuildDate>Wed, 10 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://company-engineering.pages.dev/posts/index.xml" rel="self" type="application/rss+xml"/><item><title>Observability Patterns for Distributed Systems</title><link>https://company-engineering.pages.dev/posts/observability-patterns-for-distributed-systems/</link><pubDate>Wed, 10 Jun 2026 00:00:00 +0000</pubDate><guid>https://company-engineering.pages.dev/posts/observability-patterns-for-distributed-systems/</guid><description>&lt;p&gt;When a request crosses a dozen services, &amp;ldquo;check the logs&amp;rdquo; stops being useful
advice. Observability is what lets us ask new questions about a running system
without shipping new code to answer them. These are the patterns we rely on.&lt;/p&gt;
&lt;h2 id="the-three-signals-and-what-each-is-for"&gt;The three signals, and what each is for&lt;/h2&gt;
&lt;p&gt;We treat metrics, logs, and traces as complementary rather than interchangeable:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Metrics&lt;/strong&gt; answer &lt;em&gt;how much&lt;/em&gt; and &lt;em&gt;how often&lt;/em&gt; — and they&amp;rsquo;re cheap enough to
keep for everything. They&amp;rsquo;re how we notice a problem.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Traces&lt;/strong&gt; answer &lt;em&gt;where&lt;/em&gt; — they show the path of a single request across
services and where the time went. They&amp;rsquo;re how we localize a problem.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Logs&lt;/strong&gt; answer &lt;em&gt;why&lt;/em&gt; — the detailed, contextual record of what a specific
component did. They&amp;rsquo;re how we explain a problem.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;You reach for them in that order: a metric alerts, a trace narrows it to a
service, logs explain the failure.&lt;/p&gt;</description></item><item><title>What We Learned From Running Background Workers in Production</title><link>https://company-engineering.pages.dev/posts/what-we-learned-from-running-background-workers-in-production/</link><pubDate>Fri, 05 Jun 2026 00:00:00 +0000</pubDate><guid>https://company-engineering.pages.dev/posts/what-we-learned-from-running-background-workers-in-production/</guid><description>&lt;p&gt;Background workers are where a lot of our most important work happens — sending
notifications, generating exports, syncing data, billing. They&amp;rsquo;re also where the
most surprising production incidents start. Here&amp;rsquo;s what running them at scale has
taught us.&lt;/p&gt;
&lt;h2 id="jobs-are-not-functions"&gt;Jobs are not functions&lt;/h2&gt;
&lt;p&gt;A function call either returns or throws. A background job can also be retried,
duplicated, delayed for hours, killed mid-execution, or run on code that has
since been deployed over. Designing jobs means designing for all of those states,
not just success and failure.&lt;/p&gt;</description></item><item><title>Designing Reliable Data Synchronization at Scale</title><link>https://company-engineering.pages.dev/posts/designing-reliable-data-synchronization-at-scale/</link><pubDate>Mon, 01 Jun 2026 00:00:00 +0000</pubDate><guid>https://company-engineering.pages.dev/posts/designing-reliable-data-synchronization-at-scale/</guid><description>&lt;p&gt;Keeping data consistent across systems we don&amp;rsquo;t control is one of the hardest
problems we work on. Every external integration is a small distributed system:
networks fail, third-party APIs rate-limit us, and the &amp;ldquo;source of truth&amp;rdquo; is
often whichever side wrote last. This post covers the patterns we lean on to
make synchronization predictable rather than heroic.&lt;/p&gt;
&lt;h2 id="the-shape-of-the-problem"&gt;The shape of the problem&lt;/h2&gt;
&lt;p&gt;A sync flow looks deceptively simple: read from a source, transform, write to a
destination. In practice each step can fail independently, and the same change
can arrive more than once. We design every flow around three assumptions:&lt;/p&gt;</description></item></channel></rss>