Can a system ever truly be considered reliable if it isn't fundamentally secure? In two previous O'Reilly books, experts from Google showed how reliability is fundamental to service design. Site Reliability Engineering and The Site Reliability Workbook demonstrated how and why a commitment to the entire service lifecycle enables organizations to successfully build, deploy, monitor, and maintain software systems.
Security is also crucial to the design and operation of scalable systems in production, as it plays an important part in product quality, performance, reliability, and availability. In a world where most products are connected to the internet, and with cloud technologies and machine learning becoming more prevalent, enabling security by default is increasingly important.
This book shares best practices to help an organization of any size design scalable and reliable systems that are fundamentally secure. It also offers insights into how teams across an organization can collaborate on security and reliability.
Specifically, you'll learn about:
Implementation & Operations
Preparing for the Worst: Detection & Response
Scaling Security and the Organization
Heather Adkins is a 17-year Google veteran and founding member of the Google Security Team. As Sr Director of Information Security, she has built a global team responsible for maintaining the safety and security of Google’s networks, systems and applications. She has an extensive background in systems and network administration with an emphasis on practical security, and has worked to build and secure some of the world’s largest infrastructure. She now focuses her time primarily on the defense of Google’s computing infrastructure and working with industry to tackle some of the greatest security challenges.
Betsy Beyer is a Technical Writer for Google Site Reliability Engineering in NYC, and the editor of Site Reliability Engineering: How Google Runs Production Systems and The Site Reliability Workbook. She has previously written documentation for Google's Data Center and Hardware Operations Teams in Mountain View and across its globally-distributed data centers. Before moving to New York, Betsy was a lecturer on technical writing at Stanford University. En route to her current career, Betsy studied International Relations and English Literature, and holds degrees from Stanford and Tulane.
Paul Blankinship manages the Technical Writing team for Google’s Security and Privacy Engineering group. He’s previously written documentation for Google Web Designer, and helped develop Google’s internal security and privacy policies.
Piotr Lewandowski is a Staff Site Reliability Engineer, responsible for the security of Google’s Production infrastructure and ensuring harmonious collaboration between the SRE and the Security organizations. He is also one of the responders for large-scale incidents. In his previous role he led a team responsible for the reliability of Google’s critical security infrastructure. Prior to joining Google 8 years ago, he worked at CERT Polska, owned a software company and graduated from Warsaw University of Technology with a degree in Computer Science.
Ana Oprea specializes in Site Reliability Engineering, Security, and planning and strategy for Google’s Technical Infrastructure – a role that follows naturally from her previous experience as a Software Developer, Technical Consultant, and Network Admin. After working and studying in Germany, France, and Romania, she accounts for different cultural approaches when facing any challenge.
Adam Stubblefield is a Distinguished Engineer and the Horizontal Lead for Security at Google. Over the past 8 years, he’s led teams that have built much of Google’s core security infrastructure. Adam has a PhD in Computer Science from Johns Hopkins.