We’re excited to announce the next Meetup of the Site Reliability Engineering Munich group for June 13th.
This time around we will be meeting at Netlight as they have graciously agreed to host us! Lets all meet up again, talk about reliability, exchange ideas and see where we can continue to learn on our journey as site reliability engineers (and folks that aspire to be one!). Meetups are about engaging within the community, so we are looking to everyone to share ideas and learn to ultimately to reduce the risk of disasters.
We would like to thank Netlight Consulting for sponsoring this event and catering it!
Agenda#
- 6:30 pm Get together with food and drinks
- 7:00 pm Welcome to SREmuc
- 7:10 pm Talk 1: Exercising Effective Incident Response and Disaster Recovery Plan via Gamedays (Thanos Amoutzias, VW Elli)
- 7:30 pm Talk 2: Building Resilient Event-Driven Systems with GCP Pub/Sub: Key Reliability Considerations (Saadi Myftija, Netlight)
- 8:00 pm Short break
- 8:15 pm Talk 3: Eliminating human error using self-service GitOps with Crossplane (Stéphane Di Cesare and Christopher Haar, DKB)
- 8:45 pm Networking + Drinks
- 9:30 pm Leave happy and inspired :)
Speakers#
- Thanos Amoutzias is a Software Engineer and SRE Lead at VW Elli. He is passionate about building reliable services and delivering impactful products. You can find him on LinkedIn and in the mountains.
- Saadi Myftija is a consultant at Netlight, focusing on backend and cloud engineering. He’s currently working as tech lead in the EV charging platform team at VW Elli.
- Stéphane Di Cesare is a Senior Platform Engineer at DKB, where he is focusing on improving the developer experience and the developer acceptance of the internal platform. He also has a consulting background.
- Christopher Haar is Platform Tech Lead at DKB, where he is responsible of determining the technologies used by the internal platform. He is also one of the maintainers of the Crossplane open source project.
Abstracts#
Talk 1#
How can I start practicing Gamedays at my company? In this talk we are going to dive deeper into the organization of the event, from identifying incidents to run, execution and logistics to disaster recovery. Lastly, we will have a look at results and feedback we have received.
Slides /slides/2023_06/gamedays.pdf
Talk 2#
Event-driven architecture (EDA) is a common pattern in building modern service-oriented applications. It helps decoupling system components, which enables scaling, updating and deploying them independently. However, EDA comes with its own set of challenges and trade-offs. In this presentation we’ll talk about reliability considerations around GCP Pub/Sub, our event broker of choice to implement EDA. We’ll mainly focus on publisher reliability and how to monitor it, dead-letter queues and message retrying.
Talk 3#
In this talk, we are going to present what the typical challenges of a Platform team are. We will highlight the importance of self-service GitOps in a banking environment, and will explain how these principles are implemented at DKB, using the open source projects Crossplane and Flux.
Photos#
Participation#
We’re always looking for 20-45 minute (technical) talks related to the very broad field of Site Reliability Engineering. Get in touch with the organizers if you’d like to present!
Slides#
Will be added after the event
Legal#
There may be audio and video recordings of the talks and we may take photographs during the event with the purpose of sharing the learnings and advertising future events. By attending the event you give your consent to be recorded. The “Tales from On-call” sessions are never recorded and the Chatham House Rule apply: https://en.wikipedia.org/wiki/Chatham_House_Rule
Spread the word! Feel free to refer to this Meetup on social media using the #sremuc hashtag!