Skip to content

bill duncan's blog

Performance Matters (and Other Tidbits from the Trenches)

  • Home
  • About Me
  • Contact bill duncan
  • Links
bill duncan's blog

Author: bduncan

Reading Week #2

First of all, Merry Christmas if you celebrate it, Happy Holidays if you don’t! This week’s interesting read is about a subject I love.. Continue reading “Reading Week #2”

Author bduncanPosted on 2018-12-232018-12-23Categories Detecting Performance Issues, Problem Solving, Reading Week, SRE, System PerformanceTags log files

Reading Week #1

I’d like to start a new series of articles based on interesting articles to read for the week.. Continue reading “Reading Week #1”

Author bduncanPosted on 2018-12-142018-12-14Categories devops, Reading Week, SRE

WordCamp US 2018

#WCUS happening this weekend. WordPress 5.0 and Gutenberg news.. Continue reading “WordCamp US 2018”

Author bduncanPosted on 2018-12-072018-12-12Categories OT

Last Week’s Security Earthquake

Last week there were two earthshaking security events. Yes, the Marriott data breach was big, but I’d like to talk about the one you might not have heard of.. Continue reading “Last Week’s Security Earthquake”

Author bduncanPosted on 2018-12-052018-12-06Categories Security

Off Topic #1

This is off-topic, but I thought I’d share an image from a recent vacation. Continue reading “Off Topic #1”

Author bduncanPosted on 2018-12-032018-12-04Categories OT

Checklists and Runbooks

We’ve been flying planes much longer than we’ve been running systems in production, so it might be instructive to learn what we can from our fellow aviators.. Continue reading “Checklists and Runbooks”

Author bduncanPosted on 2018-11-272022-08-26Categories Aviation, Documentation, SRETags Aviation, Checklists, Runbooks, tech documentation, technical writing

A Steal on O’Reilly DevOps/SRE Books!

This is a limited time deal on 15 O’Reilly books for $15. Go. Buy. Right. Now! Continue reading “A Steal on O’Reilly DevOps/SRE Books!”

Author bduncanPosted on 2018-11-07Categories devops, Documentation, RTFM, SRETags O'Reilly Books2 Comments on A Steal on O’Reilly DevOps/SRE Books!

Geolocating Your Users

I first became interested in geolocating a few decades ago while designing an email filtering system for some customers and noticing that most of the “spam and malware” originated in half a dozen countries.. Continue reading “Geolocating Your Users”

Author bduncanPosted on 2018-11-062018-11-06Categories Geolocating

All Day DevOps 2018

I was lucky to catch most of the SRE track and Keynote speakers with the All Day DevOps event this year. Fortunately, if you missed it or want to watch some of the other tracks, the videos have been made available.

Continue reading “All Day DevOps 2018”

Author bduncanPosted on 2018-10-192018-10-19Categories devops, SRE

One Thing At A Time..

We want to learn things from any idea, test, change, upgrade or (heaven forbid) outage in production..

Continue reading “One Thing At A Time..”

Author bduncanPosted on 2018-10-032018-10-19Categories devops, Diagnosing Issues, SRE

Seeking SRE

This book just arrived this morning and I’m just through the chapter on building SRE teams. Continue reading “Seeking SRE”

Author bduncanPosted on 2018-09-272018-09-27Categories devops, SRETags site reliability engineering, SRE1 Comment on Seeking SRE

Hiring Questions, Problem 2

While most technical hiring questions aren’t all that relevant, this one might be more generally useful. Find duplicate files; the trick was the speedup.. Continue reading “Hiring Questions, Problem 2”

Author bduncanPosted on 2018-09-082018-10-26Categories Code Bits, Just-for-Fun

From the Get Go!

Learning a new computer language can be fun! Stretching ourselves to think about problems in new ways.. Continue reading “From the Get Go!”

Author bduncanPosted on 2018-08-262018-08-26Categories Code Bits, golang, Just-for-Fun

Realtime Component Request Deficit

Looking for help naming (and finding other uses for) a novel technique in detecting grey failures.  Possible use cases are discussed here: load balancing, finding saturation points, alerting.. [ed. Decided on the name “Saturation Factor“.] Continue reading “Realtime Component Request Deficit”

Author bduncanPosted on 2018-08-192018-09-10Categories Detecting Performance Issues, System Performance5 Comments on Realtime Component Request Deficit

Solving the Right Problems

Ask the right questions, listen carefully and make sure that you’re not hearing just what you want to hear.. Continue reading “Solving the Right Problems”

Author bduncanPosted on 2018-08-122018-08-12Categories Problem Solving

Don’t Panic!

Some thoughts about handling critical system issues at scale.. Continue reading “Don’t Panic!”

Author bduncanPosted on 2018-08-102018-08-12Categories Diagnosing Issues, Problem Solving, SRE

Snowflakes

We called our albino squirrel in the backyard, “Snowflake”..

Continue reading “Snowflakes”

Author bduncanPosted on 2018-08-072018-08-08Categories devops, SRE

Operations in the Cloud

As an SRE, I’m very fortunate to have had training as a pilot. There are many similarities to system operations.. Continue reading “Operations in the Cloud”

Author bduncanPosted on 2018-08-062022-12-15Categories Aviation, devops, SRE1 Comment on Operations in the Cloud

Interactive bash Scripts

Building interactive commands that uses editing history and tab completion can be easy in bash and serve as a wrapper for automating tasks. Continue reading “Interactive bash Scripts”

Author bduncanPosted on 2018-07-292018-07-29Categories Code Bits, Just-for-Fun1 Comment on Interactive bash Scripts

Must Have Books.. Another One!

Not just “Must Have”, but “Must Read!”. A new book has been released and is available, free to download for a short time. Continue reading “Must Have Books.. Another One!”

Author bduncanPosted on 2018-07-252018-07-25Categories devops, RTFM, SRE2 Comments on Must Have Books.. Another One!

vi or emacs? Really?!?

Most of the operations/engineering folks I’ve come into contact with will proclaim to be “vi” people and yet, when I watch them edit a file I cringe.. Continue reading “vi or emacs? Really?!?”

Author bduncanPosted on 2018-07-212018-07-30Categories devops, RTFM1 Comment on vi or emacs? Really?!?

CI/CD and Optimization

When we talk of CI/CD we’re often referring to Continuous Integration and Delivery while Optimization refers to Services/Systems. What I’d like to discuss is Constant Improvement/Continuous Development and Self-Optimization.. Continue reading “CI/CD and Optimization”

Author bduncanPosted on 2018-07-082018-07-10Categories devops1 Comment on CI/CD and Optimization

The DevOps Alternative

In a previous article, “There’s Always a Problem”, I described situations that can arise with the “Engineering vs. Operations” old way. The new way is a DevOps culture.. Continue reading “The DevOps Alternative”

Author bduncanPosted on 2018-07-052018-07-17Categories devops

Preventing Trainwrecks

I keep a large framed photo of this on the wall in my office to remind me what can happen when things go “off the rails”.. Continue reading “Preventing Trainwrecks”

Author bduncanPosted on 2018-06-222018-06-27Categories Just-for-FunTags preventing disaster1 Comment on Preventing Trainwrecks

Hiring Questions, Problem 1

A colleague of mine once posted a hiring question to ask prospective developers: “What is the least significant 10 digits of the series: 1^{1}+2^{2}+3^{3} .. 1000^{1000} ?”

Continue reading “Hiring Questions, Problem 1”

Author bduncanPosted on 2018-06-212018-06-27Categories Code Bits, Just-for-FunTags awk, bash, LaTeX, modular exponentiation, scripting

Don’t Aggregate, Consolidate!

In previous posts, I’ve emphasized that averages are particularly bad at characterizing most things that you might be looking for. However, storing aggregated data of any type can limit your ability to analyze data later. Continue reading “Don’t Aggregate, Consolidate!”

Author bduncanPosted on 2018-06-202018-06-27Categories Detecting Performance IssuesTags monitoring, observability, performance

awk, the Often Ignored Little Language

Many people use awk for one-liners; picking out fields from logs, doing pattern matching. It’s capable of so much more however. IMO, the “littleness” of the language is one of it’s strengths. Continue reading “awk, the Often Ignored Little Language”

Author bduncanPosted on 2018-06-202018-08-30Categories Code Bits, Just-for-FunTags awk, recursion, sudoku1 Comment on awk, the Often Ignored Little Language

Bitrot, Part 2

This article has a link to a simple script I’ve used for over a decade to detect corrupted files. It will detect and report on files that have changed, been added, deleted or possibly moved within the same directory structure. Continue reading “Bitrot, Part 2”

Author bduncanPosted on 2018-06-192018-06-27Categories Code Bits, Data CorruptionTags bitrot, data corruption, filesystem

Bitrot, Part 1

Your systems have drives set up in RAID configurations and besides, you have data copied to redundant systems and backups, right? Safe? Maybe not. I recently found corruption in a quarter of a million files that had not previously been detected, for years! Continue reading “Bitrot, Part 1”

Author bduncanPosted on 2018-06-062018-06-09Categories Data CorruptionTags bitrot, data corruption, filesystem1 Comment on Bitrot, Part 1

There’s Always a Problem

Do you have insatiable curiosity and are driven by a relentless pursuit of the truth? You might make a great problem solver, but be careful how you deal with your findings! Continue reading “There’s Always a Problem”

Author bduncanPosted on 2018-06-042018-07-05Categories Detecting Performance Issues, Diagnosing Issues, System PerformanceTags diagnosing issues, engineering, latency, monitoring, observability, operations, performance, software stack

Posts pagination

Previous page Page 1 Page 2 Page 3 Next page

Recent Posts

  • openssl wrapper script
  • Anagrams for Fun
  • Good Luck and Happy New Year!

Archives

  • April 2023
  • January 2023
  • September 2022
  • August 2022
  • July 2021
  • February 2021
  • December 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • December 2019
  • November 2019
  • October 2019
  • May 2019
  • January 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • August 2018
  • July 2018
  • June 2018
  • May 2018
  • April 2018
  • March 2018

Categories

  • Aviation (7)
  • Code Bits (14)
  • Data Corruption (2)
  • Detecting Performance Issues (14)
  • devops (27)
  • Diagnosing Issues (9)
  • Documentation (5)
  • Event logs (2)
  • Geolocating (1)
  • golang (1)
  • Just-for-Fun (12)
  • K8S (2)
  • Monitoring (4)
  • OT (6)
  • Problem Solving (5)
  • R (2)
  • Reading Week (6)
  • RemoteWork (2)
  • RTFM (5)
  • Security (2)
  • SRE (32)
  • System Performance (13)
  • Uncategorized (1)
  • WFM (5)

Archives

  • April 2023
  • January 2023
  • September 2022
  • August 2022
  • July 2021
  • February 2021
  • December 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • December 2019
  • November 2019
  • October 2019
  • May 2019
  • January 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • August 2018
  • July 2018
  • June 2018
  • May 2018
  • April 2018
  • March 2018
  • Home
  • About Me
  • Contact bill duncan
  • Links
bill duncan's blog Proudly powered by WordPress