I’m a big believer in “wrapper scripts”.
K8S Tips and Scripts #1
I’ve often come across situations where pods won’t die properly. Either the underlying node is in a strange state, temporary networking issues, whatever. If a node has gone south, pods will often stick in an “Evicted” state for example.
Where Are We Going?
“Where are we going?” my wife asks as we leave the runway on her first flight with me after I received my pilot license. Continue reading “Where Are We Going?”
Finding the Information You Need
Nothing sucks enthusiasm for a job faster than a frustrating search for information you need to do the job that turns up empty or irrelevant. Even worse is when the information you find is outdated or even lies to you!
Thoughts on Documentation
When I worked at RIM (Blackberry), we had the luxury of having technical writers on staff. They were brilliant at yanking information out of engineering and organizing it into cohesive sets of documentation. Not all companies or departments can afford a good technical writers on staff, so we all need to pitch in and do it. What constitutes good technical writing?
K8S Tips and Scripts #0
It’s time to start writing again.. It’s been awhile..
I’ll start by sharing some tips I’ve learned and scripts I’ve written for dealing with Kubernetes.
Command Line Interface Guidelines
Anyone who knows me, knows that I am most comfortable and at home on the unix/linux command line. Continue reading “Command Line Interface Guidelines”
The Tail at Scale
The landmark “Tail at Scale” article was missing some of the math. We’re diving into it a bit here to show how the math can be used in setting objectives for latency budgets in back end systems. Continue reading “The Tail at Scale”
What is SRE?
The current state of confusion around what a “Site Reliability Engineer” (SRE) role is..
Continue reading “What is SRE?”
BPF Performance Tools
BPF is one of the Swiss Army Knife tools for Performance Engineering on Linux. Continue reading “BPF Performance Tools”
Event Logs and A.I.
Many companies in the logging/monitoring space will try to sell you on AI and ML (Artificial Intelligence and Machine Learning) to find abnormal. Continue reading “Event Logs and A.I.”
Event Logs and K.I.S.S.
I’ve worked with event logs for, well, decades. There are quite a few companies that offer services for managing logs and, afaik, only a few doing it right. Continue reading “Event Logs and K.I.S.S.”
SPOFs and Partial Panel
In both aviation and systems we build in redundancies wherever practical to avoid unpleasantness when components or subsystems fail. Continue reading “SPOFs and Partial Panel”
Traffic At 2 O’clock!
Up in the air, your eyes can’t be everywhere, all the time. You’re trained to scan the skies for “traffic” (other flying machines) as well as scanning instrumentation in the cockpit. Continue reading “Traffic At 2 O’clock!”
Own It !!
We were heading back from the practice area to the airport. I didn’t have my pilot license yet and my instructor says: “Push the throttle to Rental Speed!”. Continue reading “Own It !!”
Reading Week #3
Here are some interesting reads if you’re fortunate in having some extra time off this Holiday Season.. Continue reading “Reading Week #3”
Reading Week #1
I’d like to start a new series of articles based on interesting articles to read for the week.. Continue reading “Reading Week #1”
A Steal on O’Reilly DevOps/SRE Books!
This is a limited time deal on 15 O’Reilly books for $15. Go. Buy. Right. Now! Continue reading “A Steal on O’Reilly DevOps/SRE Books!”
All Day DevOps 2018
I was lucky to catch most of the SRE track and Keynote speakers with the All Day DevOps event this year. Fortunately, if you missed it or want to watch some of the other tracks, the videos have been made available.
One Thing At A Time..
We want to learn things from any idea, test, change, upgrade or (heaven forbid) outage in production..
This book just arrived this morning and I’m just through the chapter on building SRE teams. Continue reading “Seeking SRE”
We called our albino squirrel in the backyard, “Snowflake”..
Operations in the Cloud
As an SRE, I’m very fortunate to have had training as a pilot. There are many similarities to system operations.. Continue reading “Operations in the Cloud”
Must Have Books.. Another One!
Not just “Must Have”, but “Must Read!”. A new book has been released and is available, free to download for a short time. Continue reading “Must Have Books.. Another One!”
vi or emacs? Really?!?
Most of the operations/engineering folks I’ve come into contact with will proclaim to be “vi” people and yet, when I watch them edit a file I cringe.. Continue reading “vi or emacs? Really?!?”
CI/CD and Optimization
When we talk of CI/CD we’re often referring to Continuous Integration and Delivery while Optimization refers to Services/Systems. What I’d like to discuss is Constant Improvement/Continuous Development and Self-Optimization.. Continue reading “CI/CD and Optimization”
The DevOps Alternative
In a previous article, “There’s Always a Problem”, I described situations that can arise with the “Engineering vs. Operations” old way. The new way is a DevOps culture.. Continue reading “The DevOps Alternative”