Last modified January 21, 2025
Prepare your AWS account
This guide provides the necessary steps to prepare your AWS accounts to run our platform, including Cluster API Provider for AWS (CAPA).
Requirements
You can run the management and workload clusters in the same account or separate accounts in AWS environments. To help you take the decision, please read our multi-account article, where we explain the pros and cons of both approaches. Most requirements are related to configuring Identity and Access Management (IAM) roles and service quotas.
To perform the following steps, you must access the AWS console or have AWS CLI installed and pointed to the right account.
Step 1: Service quotas
AWS establishes default quotas for all your cloud services as described in the provider documentation. The following overview lists the quotas you must adjust to use the account to operate Giant Swarm clusters.
Adjusting a service quota requires issuing a support case in the AWS Support Center, where you will find a specific entry form for each type of case. Each quota type requires a separate case. When creating these, log in to the proper account and select the correct region. If you plan to deploy clusters to multiple regions, ensure the quotas are raised for each region.
Below is a screenshot of a service quota entry form.
Please request an increase in the following quotas (grouped by type):
- VPC
- VPCs per region: 50
- NAT Gateway per Availability Zone per region: 50 (not needed if you are creating a private cluster)
- IPv4 CIDR blocks per VPC: 50
- Routes per route table: 200
- Route 53 Resolver
- Endpoints per AWS region: 100 (needed if you are creating a private cluster)
- Elastic IP
- New VPC Elastic IP Address Limit per region: 50 (not needed if you are creating a private cluster)
- Elastic Load Balancers
- Application and Classic Load Balancers per region: 100
- Auto Scaling
- Auto Scaling Groups per region: 250
- Launch Configurations per region: 500
- S3
- Buckets per Account: 1000
- EC2 Spot Instances
- Set the limit according to your needs for every primary instance type you tend to use spot instances with.
- EC2 Instances
- m4.xlarge per region: 250
- m4.2xlarge per region: 250
- m5.2xlarge per region: 250
- other instance types to be used as workers: increase accordingly
Note: Please extend the list of EC2 instances to contain the frequently needed types.
You will be asked to describe your use case when requesting a service quota increase. Use this text for the following purposes:
We intend to run multiple Kubernetes clusters in this account, potentially used
by various globally distributed teams. We will be creating and deleting new
clusters frequently.
Every cluster needs its own VPC for security/isolation reasons and its own
Elastic IP address for the NAT gateway.
Every cluster has at least 1 Auto Scaling Group, but can contain multiple ASGs if
multiple instance types are requested as cluster nodes. If we count 50
clusters with up to 5 EC2 instances each, as worker nodes, we need up to 250
ASGs. To update the ASGs in a rolling manner, we need to duplicate the ASGs
for a short time during the update, hence the 500 Launch Configurations.
The number of EC2 instances used as worker nodes is supposed to be scaled
dynamically based on traffic, hence the high numbers of EC2 instances requested.
Step 2: Permissions
Two types of IAM roles need to be created in the AWS account.
First, the CAPA controller in the management cluster uses the controller role to provision all infrastructure for managing workload clusters.
Second, the staff role is used by Giant Swarm engineers to access the AWS account for support purposes.
Here are the mandatory steps to create the required IAM resources, but we advocate following one of the automated options. There, you have Terraform or CloudFormation templates to create the necessary IAM roles and policies. Otherwise, follow the manual steps below.
Controller permissions
1. Create all policies for the Cluster API controller
Open the IAM section of the AWS console and go to the Policies subsection.
Our platform needs different permissions to manage various resources on the AWS side. Here is the list of policies that need to be created. Click Create policy and copy the policy content into the JSON editor field. Hit the Next
button and add the policy name matching the policy link name. Repeat this process for each policy:
- giantswarm-${INSTALLATION_NAME}-capa-controller-policy
- giantswarm-${INSTALLATION_NAME}-crossplane-policy
- giantswarm-${INSTALLATION_NAME}-dns-controller-policy
- giantswarm-${INSTALLATION_NAME}-eks-controller-policy
- giantswarm-${INSTALLATION_NAME}-iam-controller-policy
- giantswarm-${INSTALLATION_NAME}-irsa-operator-policy
- giantswarm-${INSTALLATION_NAME}-mc-bootstrap-policy
- giantswarm-${INSTALLATION_NAME}-network-topology-operator-policy
- giantswarm-${INSTALLATION_NAME}-resolver-rules-operator-policy
Warning: When filling out the policy name, remember to replace the INSTALLATION_NAME
placeholder with the name of your installation.
Note: All policy names contain the installation name to make it easier to identify the policies and avoid conflicts with other installations running within the same account.
2. Create the role metadata
Now move to the IAM Roles subsection, and hit the Create role button. In the following screen, when asked to Select the type of trusted entity, choose AWS account.
The Account ID you need to enter is the AWS account where the management cluster runs. In case you run all your clusters in a single account, there is no need to change it, but if your management cluster runs in a separate account, you need to use that one for Another AWS account
option.
It’s crucial that the Require external ID and Require MFA options remain unchecked so that CAPA can run without requiring human intervention.
3. Attach all policies to role
Now, you must go through all the policies created before and attach them to the role. You can just search for the installation name, select the policies and hit the Next button.
4. Name the role
The last step of role creation requires you to set a name for the role. As with the policies, please use the name changing the INSTALLATION_NAME
placeholder to the name of your management cluster.
giantswarm-${INSTALLATION_NAME}-capa-controller
Staff permissions
Finally, we create an IAM role for Giant Swarm support staff to assume access to both the management cluster’s and workload clusters’ AWS accounts. This role must have Giant Swarm’s account as a trusted entity, and we recommend that it enforce multi-factor authentication.
1. Create the admin policy
Go to the Policies subsection and select Create policy to set the admin permissions. Use the admin JSON policy file as the policy content. This time, call the policy.
GiantSwarmAdmin
2. Create the admin role
Enter again in the Roles subsection of the AWS console and select Create role When asked to Select type of trusted entity choose AWS account.
- In Account ID enter the value
084190472784
. - Don’t enable Require external ID.
- It’s strongly recommended to check the option Require MFA (multi factor authentication). This adds an extra authentication step for users to assume the role, which increases security.
3. Attach policy to role
Attach the newly created GiantSwarmAdmin
policy to the role you are creating.
4. Name the role
Name this role:
GiantSwarmAdmin
Giant Swarm staff require access to all AWS resources, so please repeat the above steps for every account where you will run clusters.
Step 3: Configure the cluster role identity
The last step involves storing the AWS credentials in the platform to allow the CAPA controller to manage your account’s infrastructure. In Cluster API there is a custom resource called AWSClusterRoleIdentity
that stores the AWS role ARN and the role’s name. Take into account that this resource is namespaced.
By default, the controller uses a default
configuration, which points to the role in the management cluster account. If you want to create a new workload cluster in a new AWS account, you need to create a new AWSClusterRoleIdentity
resource that references the role of that AWS account. Example:
apiVersion: infrastructure.cluster.x-k8s.io/v1beta1
kind: AWSClusterRoleIdentity
metadata:
labels:
cluster.x-k8s.io/watch-filter: capi
name: <ACCOUNT_NAME>
spec:
allowedNamespaces:
list: null
selector: {}
roleARN: arn:aws:iam::${ACCOUNT_ID}:role/giantswarm-golem-capa-controller
sourceIdentityRef:
kind: AWSClusterControllerIdentity
name: default
The <ACCOUNT_NAME>
is a short unique name referencing the AWS account (development
, sandbox
or staging2
). We advocate using the same name as the organization to help map the resources and the accounts. The <ACCOUNT_ID>
is the AWS account ID where the role is created and where the workload cluster will be provisioned.
Note: The official documentation provides more information about configuring AWS credentials.
In the next step you define which role the AWSCluster
uses to provision the cluster adjusting the value aws.awsClusterRoleIdentityName
.
Note: In case you are working with a Giant Swarm partner, you might not have access to the platform API. In that case, please provide the role ARNs values, CAPA controller and staff to your partner contact.
Warning: If you AWS account is a China account, make sure to follow the ICP Filing process.
Next steps
Contact your Giant Swarm account engineer to verify the setup and proceed with the provisioning of the management cluster. For sharing any secret with us please read this article first. In case you have already set up the management cluster and you have just configured a new AWS account, you can proceed with the creation of the workload cluster.
Need help, got feedback?
We listen to your Slack support channel. You can also reach us at support@giantswarm.io. And of course, we welcome your pull requests!