sre-interview-prep-guide/README.md

188 lines
10 KiB
Markdown
Raw Normal View History

2019-01-20 16:08:28 +03:00
# Site Reliability Engineer (SRE) Interview Preparation Guide
2020-07-09 21:42:03 +03:00
[![HitCount](http://hits.dwyl.com/mxssl/sre-interview-prep-guide.svg)](http://hits.dwyl.com/mxssl/sre-interview-prep-guide)
2019-01-20 16:08:28 +03:00
This repository is an attempt to consolidate useful resources for Site Reliability Engineer (SRE) interview preparation.
## Basics
2020-02-02 01:28:02 +03:00
* Simple: [What happens when you type in www.cnn.com in your browser?](https://syedali.net/2013/08/18/what-happens-when-you-type-in-www-cnn-com-in-your-browser)
* Detailed: [What happens when you type google.com into your browser's address box and press enter?](https://github.com/alex/what-happens-when)
2019-01-20 16:08:28 +03:00
## Linux
### Boot Process
* [An introduction to the Linux boot and startup processes](https://opensource.com/article/17/2/linux-boot-and-startup)
* [What happens when we turn on computer?](https://www.cdn.geeksforgeeks.org/what-happens-when-we-turn-on-computer)
* [What happens when we turn on computer?](https://leetcode.com/discuss/interview-question/125107/What-happens-when-we-turn-on-computer)
* [From Power up to login prompt](http://www.scott-a-s.com/files/linux_boot.pdf)
2019-01-20 16:08:28 +03:00
2019-01-27 09:45:33 +03:00
### Filesystem
2019-01-20 16:08:28 +03:00
* [Understanding Inodes](https://syedali.net/2015/02/08/understanding-inodes)
* [Understand UNIX / Linux Inodes Basics with Examples](https://www.thegeekstuff.com/2012/01/linux-inodes)
* [Understanding proc filesystem](https://syedali.net/2013/08/20/understanding-proc-filesystem)
* [Common Mount Options](https://syedali.net/2015/01/06/common-mount-options)
* [Understanding Linux filesystems: ext4 and beyond](https://opensource.com/article/18/4/ext4-filesystem)
### Kernel
2019-01-29 03:04:44 +03:00
* [Explain the basics of Linux kernel](http://learnlinuxconcepts.blogspot.com/2014/03/explain-basics-of-linux-kernel.html)
* [Kernel Space and User Space](http://learnlinuxconcepts.blogspot.com/2014/02/kernel-space-and-user-space.html)
2019-01-20 16:08:28 +03:00
* [Linux Kernel Process Management](http://learnlinuxconcepts.blogspot.com/2014/03/process-management.html)
2019-01-29 03:04:44 +03:00
* [Linux Addressing](http://learnlinuxconcepts.blogspot.com/2014/02/linux-addressing.html)
2019-01-20 16:08:28 +03:00
* [Linux Kernel Memory Management](http://learnlinuxconcepts.blogspot.com/2014/02/linux-memory-management.html)
2019-01-29 03:04:44 +03:00
* [STACK AND HEAP](http://learnlinuxconcepts.blogspot.com/2014/02/stack-and-heap.html)
* [Paging and Segmentation](http://learnlinuxconcepts.blogspot.com/2014/02/paging-and-segmentation.html)
2019-01-20 16:08:28 +03:00
* [Linux Kernel System Calls](http://learnlinuxconcepts.blogspot.com/2014/02/system-calls.html)
2019-01-29 03:04:44 +03:00
* [The Virtual Filesystem](http://learnlinuxconcepts.blogspot.com/2014/10/the-virtual-filesystem.html)
* [Concurrency and Race Conditions](http://learnlinuxconcepts.blogspot.com/2014/07/concurrency-and-race-conditions.html)
* [Memory Leak](https://stackoverflow.com/questions/312069/the-best-memory-leak-definition)
* [What is a kernel Panic?](http://learnlinuxconcepts.blogspot.com/2014/07/what-is-kernel-panic.html)
2019-01-20 16:08:28 +03:00
### Troubleshooting
* [Linux troubleshooting tools](https://syedali.net/2013/08/20/linux-troubleshooting-tools)
* [Linux Performance Analysis in 60,000 Milliseconds](https://medium.com/netflix-techblog/linux-performance-analysis-in-60-000-milliseconds-accc10403c55)
## Networking
2019-01-23 20:35:14 +03:00
* [Network protocols for anyone who knows a programming language](https://www.destroyallsoftware.com/compendium/network-protocols?share_key=97d3ba4c24d21147)
2019-01-28 16:36:09 +03:00
* [Introduction to Linux interfaces for virtual networking](https://developers.redhat.com/blog/2018/10/22/introduction-to-linux-interfaces-for-virtual-networking)
2019-01-29 02:42:51 +03:00
* [Multi-tier load-balancing with Linux](https://vincent.bernat.ch/en/blog/2018-multi-tier-loadbalancer)
2019-01-20 16:08:28 +03:00
* [Introduction to modern network load balancing and proxying](https://blog.envoyproxy.io/introduction-to-modern-network-load-balancing-and-proxying-a57f6ff80236)
* [Load Balancing Algorithms](https://syedali.net/2013/08/22/load-balancing-algorithms)
2019-01-21 04:37:31 +03:00
## Containers
* [Introduction to Docker and Containers](http://container.training/intro-selfpaced.yml.html)
2019-01-27 07:57:56 +03:00
* [Containers Patterns](https://l0rd.github.io/containerspatterns)
* [Docker Container Anti Patterns](https://blog.couchbase.com/docker-container-anti-patterns/)
2019-01-27 07:57:56 +03:00
2019-04-07 18:02:06 +03:00
## Kubernetes
* [Deploying and Scaling Microservices with Docker and Kubernetes](http://container.training/kube-selfpaced.yml.html)
* [What happens when ... Kubernetes edition!](https://github.com/jamiehannaford/what-happens-when-k8s/blob/master/README.md)
* [Kubernetes Production Patterns](https://github.com/gravitational/workshop/blob/master/k8sprod.md)
2019-11-15 16:12:49 +03:00
* [Kubernetes production best practices](https://learnk8s.io/production-best-practices)
2020-07-26 09:12:25 +03:00
* [A Guide to the Kubernetes Networking Model](https://sookocheff.com/post/kubernetes/understanding-kubernetes-networking-model)
2019-04-07 18:02:06 +03:00
2019-01-27 09:45:33 +03:00
## Infrastructure as code / Configuration management
* [Terraform](https://learn.hashicorp.com/terraform)
* [Ansible](https://github.com/leucos/ansible-tuto)
2019-01-27 07:57:56 +03:00
## CI/CD
2020-07-19 19:20:04 +03:00
* [7 Pipeline Design Patterns for Continuous Delivery](https://www.singlestoneconsulting.com/blog/7-pipeline-design-patterns-for-continuous-delivery)
2020-03-18 11:59:34 +03:00
* [CI/CD patterns](https://continuousdelivery.com/implementing/patterns)
2019-06-29 13:36:14 +03:00
* [Six Strategies for Application Deployment](https://thenewstack.io/deployment-strategies)
2019-01-21 04:37:31 +03:00
2019-06-30 04:26:59 +03:00
## Clouds
* [The Open Guide to Amazon Web Services](https://github.com/open-guides/og-aws)
2020-10-30 18:40:37 +03:00
* [Learning Azure](https://docs.microsoft.com/en-us/learn/azure/)
* [Hands-On Training with GCP](https://cloud.google.com/training/badges)
2019-06-30 04:26:59 +03:00
2019-01-20 16:08:28 +03:00
## Programming
2020-10-30 18:40:37 +03:00
### Python
* [Python Basics](https://pythonbasics.org/)
* [Python For Everyone](https://www.py4e.com/)
2019-01-20 16:08:28 +03:00
### Go (Golang)
* [A tour of Go](https://tour.golang.org)
* [Go by Example](https://gobyexample.com)
2020-10-20 18:18:38 +03:00
* [Learn Go with Tests](https://quii.gitbook.io/learn-go-with-tests/)
2019-01-20 16:08:28 +03:00
* [Getting up and running with Go](http://www.golangprograms.com)
* [Effective Go](https://golang.org/doc/effective_go.html)
2019-01-25 20:13:26 +03:00
* [Go Design Patterns](https://github.com/tmrts/go-patterns)
2019-01-20 16:08:28 +03:00
* [Go Memory Management](https://povilasv.me/go-memory-management)
### Big O Notation, Algorithms and Data Structures
* [AlgoExperts](https://www.algoexpert.io)
* [Hacking a Google Interview Handout 1](http://courses.csail.mit.edu/iap/interview/Hacking_a_Google_Interview_Handout_1.pdf)
* [Hacking a Google Interview Handout 2](http://courses.csail.mit.edu/iap/interview/Hacking_a_Google_Interview_Handout_2.pdf)
* [Hacking a Google Interview Handout 3](http://courses.csail.mit.edu/iap/interview/Hacking_a_Google_Interview_Handout_3.pdf)
## System design
2020-02-14 15:06:48 +03:00
* [SystemsExpert course from AlgoExpert](https://www.algoexpert.io/se/product)
* [Grokking the System Design Interview](https://www.educative.io/collection/5668639101419520/5649050225344512)
2019-01-20 16:08:28 +03:00
* [The System Design Primer](https://github.com/donnemartin/system-design-primer)
* [Crack the System Design Interview](https://www.puncsky.com/blog/2016/02/14/crack-the-system-design-interview)
* [System design interview for IT companies](https://github.com/checkcheckzz/system-design-interview)
* [Web Architecture 101](https://engineering.videoblocks.com/web-architecture-101-a3224e126947)
* [What's in a Production Web Application?](https://stephenmann.io/post/whats-in-a-production-web-application)
## Monitoring
* [SLOs & You: A Guide To Service Level Objectives](https://www.circonus.com/2018/07/a-guide-to-service-level-objectives)
2019-02-03 19:45:14 +03:00
## Processes
* [Incident Response](https://response.pagerduty.com)
* [Postmortems](https://postmortems.pagerduty.com)
2019-11-17 21:26:26 +03:00
* [Runbooks](https://www.transposit.com/blog/2019.11.14-what-makes-a-good-runbook)
2020-02-02 01:28:02 +03:00
* [Identifying and tracking toil using SRE principles](https://cloud.google.com/blog/products/management-tools/identifying-and-tracking-toil-using-sre-principles)
* [Building SRE from Scratch](https://medium.com/ibm-garage/building-sre-from-scratch-485e23985bbd)
2019-02-03 19:45:14 +03:00
2019-01-20 16:08:28 +03:00
## Interview
### SRE interview process
* [How to hire talent](https://syedali.net/2014/04/01/how-to-hire-talent)
2019-03-14 04:55:06 +03:00
* [Recruitment process for a Google job (SRE, Site Reliability Engineer)](http://lambda-startup.com/recruitment-process-for-a-google-job-sre-site-reliability-engineer)
2019-01-20 16:08:28 +03:00
### Interview Questions
* [A collection of questions to practice with for SRE interviews](https://github.com/michael-kehoe/sre-interview)
* [SRE Interview Questions](https://syedali.net/engineer-interview-questions)
* [Sysadmin Test Questions](https://github.com/trimstray/test-your-sysadmin-skills)
2019-02-25 18:42:48 +03:00
* [Kubernetes job interview questions](https://enterprisersproject.com/article/2019/2/kubernetes-job-interview-questions-how-prepare)
2019-11-05 05:54:23 +03:00
* [DevOps Guide](https://github.com/Tikam02/DevOps-Guide)
2020-01-06 21:50:42 +03:00
* [Questions I ask in SRE interviews](https://dev.to/logan/questions-i-ask-in-sre-interviews-a9j)
2020-02-10 23:43:31 +03:00
* [DevOps Roadmap: Learn to become a DevOps Engineer or SRE](https://roadmap.sh/devops)
2019-01-20 16:08:28 +03:00
### Blogposts
* [SRE Interviews in Silicon Valley](http://blog.marc-seeger.de/2015/05/01/sre-interviews-in-silicon-valley)
* [Preparing the SRE interview](https://blog.balthazar-rouberol.com/preparing-the-sre-interview)
2019-04-10 14:19:42 +03:00
* [How to Get Into SRE](https://blog.alicegoldfuss.com/how-to-get-into-sre)
2019-11-10 13:55:03 +03:00
* [My Job Interview at Google](https://catonmat.net/my-job-interview-at-google)
2019-11-18 03:47:18 +03:00
* [Path to Site Reliability Management](https://danrl.com/blog/2019/path-to-srm)
2020-03-06 19:17:33 +03:00
* [Becoming a Site Reliability Engineer](https://tik.dev/becoming-an-sre)
2019-01-20 16:08:28 +03:00
## Books
### SRE books
* [Site Reliability Engineering](https://landing.google.com/sre/sre-book/toc/index.html)
* [The Site Reliability Workbook](https://landing.google.com/sre/workbook/toc/)
2019-01-20 16:08:28 +03:00
* [Seeking SRE](https://books.google.ru/books?id=tmhqDwAAQBAJ)
* [Building Secure and Reliable Systems](https://static.googleusercontent.com/media/landing.google.com/ru//sre/static/pdf/Building_Secure_and_Reliable_Systems.pdf)
* [Implementing Service Level Objectives](https://learning.oreilly.com/library/view/implementing-service-level/9781492076803)
2019-01-20 16:08:28 +03:00
### Linux
* [Linux Kernel Development (3rd Edition)](https://www.amazon.com/Linux-Kernel-Development-Robert-Love/dp/0672329468)
* [UNIX and Linux System Administration Handbook (5th Edition)](https://www.amazon.com/UNIX-Linux-System-Administration-Handbook/dp/0134277554)
* [Linux Pocket Guide, 3rd Edition](http://shop.oreilly.com/product/0636920040927.do)
2019-01-23 20:35:14 +03:00
### Networking
* [TCP/IP Illustrated, Volume 1](https://www.amazon.com/TCP-Illustrated-Protocols-Addison-Wesley-Professional/dp/0321336313)
### Troubleshooting and Performance
* [Systems Performance: Enterprise and the Cloud](https://www.amazon.com/Systems-Performance-Enterprise-Brendan-Gregg/dp/0133390098)
2019-01-20 16:08:28 +03:00
## Courses
2019-01-20 16:10:13 +03:00
* [Site Reliability Engineering: Measuring and Managing Reliability](https://www.coursera.org/learn/site-reliability-engineering-slos)