An account of how a developer, Alexey Grigorev, accidentally deleted 2.5 years of data from his AI Shipping Labs and DataTalks.Club websites using Claude Code and Terraform. Grigorev intended to migrate his website to AWS, but a missing state file and subsequent actions by Claude Code led to a complete wipe of the production setup, including the database and snapshots. The data was ultimately restored with help from Amazon Business support. The article highlights the importance of backups, careful permissions management, and manual review of potentially destructive actions performed by AI agents.
Amazon Web Services (AWS) recently made a significant move by laying off approximately 40% of its DevOps staff. This decision wasn't a sign of downsizing, but rather a strategic shift towards automation and a new tool called 'Dahlia'. This article explores the reasons behind the layoffs, the capabilities of Dahlia, and its potential impact on the future of DevOps.
The article details Amazon Web Services' (AWS) recent decision to lay off a significant portion (around 40%) of its DevOps workforce, specifically those involved in managing and maintaining its own internal infrastructure. This isn't a sign of AWS abandoning DevOps, but rather a strategic shift *towards* fully embracing a "platform engineering" approach and leveraging automation tools.
* **Shift to Platform Engineering:** AWS is building internal "developer platforms" – self-service tools and standardized components – to empower application development teams to manage their own infrastructure and deployments with less reliance on centralized DevOps teams.
* **Key Tools Driving the Change:** The article highlights three main tools enabling this transition:
* **Pulumi:** An Infrastructure-as-Code (IaC) tool allowing developers to define infrastructure using familiar programming languages (Python, JavaScript, Go, etc.).
* **Crossplane:** An open-source Kubernetes add-on that extends Kubernetes to manage infrastructure across multiple cloud providers.
* **Backstage:** A developer portal created by Spotify, now open-source, that provides a centralized interface for developers to discover, create, and manage software components and infrastructure.
* **Impact of the Layoffs:** The layoffs were concentrated in teams traditionally responsible for manual infrastructure provisioning and maintenance. The remaining DevOps staff are being re-focused on building and maintaining the internal developer platforms.
* **Wider Industry Trend:** This move by AWS reflects a broader trend in the industry towards platform engineering, driven by the need for faster innovation, increased developer productivity, and reduced operational overhead.
In essence, AWS is automating away much of the traditional DevOps work, allowing developers to self-serve their infrastructure needs through these platform tools. This is a strategic move to scale its internal development efforts and accelerate innovation.
Late last year, startup Platform Engineering Labs made waves in the world of Infrastructure as Code (IaC) by introducing a new IaC platform, called Formae, available initially on Amazon Web Services. This week, Platform Engineering Labs‘ platform gets (beta) support from additional cloud platforms, including Google Cloud Platform, Microsoft Azure, Oracle Cloud Infrastructure, and OVHcloud. The company has also released new AI-enhanced software for managing infrastructure tooling, called the Platform for Infrastructure Builders.
Platform Engineering Labs has released formae, an open-source infrastructure-as-code platform designed to address limitations in existing tools, focusing on automatic discovery, codification of existing infrastructure, and a reconcile/patch workflow. It uses PKL instead of HCL and targets reducing drift and complexity.
Plural is bringing AI into the DevOps lifecycle with a new release that leverages a unified GitOps platform as a RAG engine. This provides AI-powered troubleshooting, natural language infrastructure querying, autonomous upgrade assistance, and agentic workflows for infrastructure modification, all with enterprise-grade guardrails.
The article discusses 5 self-hosted services (NetBox, Harvester, Terraform, Ansible, and Kubernetes) that the author finds useful in their home lab despite being potentially overkill for typical setups. It details the benefits and use cases for each service, highlighting how they enhance the author's tinkering and learning experience.
Learn how to automate your home lab using Terraform, Ansible, Packer & GitLab CI/CD for consistent, version-controlled, and scalable infrastructure.
While current large language models (LLMs) can generate syntactically correct Terraform HCL code, they often miss critical elements like permissions, event triggers, and best practices. Iterative refinement with developer input is necessary to produce deployable, functional stacks. The article suggests using tools like Nitric to provide application context and enforce security, dependencies, and best practices.
This article provides a cheatsheet on the Infrastructure as Code (IaC) landscape, highlighting the benefits of scalable infrastructure provisioning in terms of availability, scalability, repeatability, and cost-effectiveness. It discusses strategies such as containerization, container orchestration, and tools like Terraform, Kubernetes, and Ansible. The article also introduces GitOps as a method for automating infrastructure updates through Git workflows and CI/CD.
An introduction to using Terraform for Infrastructure as Code (IaC) practices, providing beginners with a guide to start with Terraform.