bootstrap shooting at the clouds

Posted on May 9, 2012 in blogSaaS

Written by Yoav Aner, April 9 2012

One of my primary aims when building a resillient cloud architecture, is being able to spawn instances quickly. Many cloud providers give you tools to create images or snapshots of existing cloud instances and launch them. This is great, but not particularly portable. If I have one instance on Linode and I want to clone it to Rackspace, I can’t easily do that.

That’s one of the reasons I am creating bootstrap scripts that completely automate a server (re)build process. Given an IP address and root password, the script should connect to the instance, install all necessary packages, pull the code from the repository, initialize the database, configure the web server and get the server ready for restore of user-data.

I’m primarily using fabric for automating this process, and use a standard operating system across different cloud providers. This allows a fairly consistent deployments across different providers. This also means the architecture is not dependent on a single provider, which in my opinion gives a huge benefit. Not only can my architecture run on different data centres or geographic locations, but I can also be flxeible in the choice of hosting providers.

All that aside however, building and refining this bootstrapping process allowed me to run it across different cloud providers, namely: Rackspace, Linode, and EC2. Whilst running the bootrstrapping process many times, I thought it might be a great opportunity to compare performance of those providers side-by-side. My bootstrap process runs the same commands in order, and covers quite a variety of operations. This should give an interesting indication on how each of the cloud providers performs.

Tested platforms

The tests were carried out using the default Debian 6 Squeeze on the lowest-end cloud instances on all three providers:

  • Rackspace 256Mb and 512Mb – using the London data centre.
  • Linode 512 – using the London data centre.
  • EC2 micro instance (EBS volume) – using the Ireland data centre.

Bootstrap process

The bootstrap process executes the following tasks:

  1. apt-get update && apt-get upgrade and installing a list of prerequisite packages
  2. Installing Postgresql from backports
  3. Downloading, compiling and installing ruby and sphinx from source
  4. Setting up SSH keys
  5. Pulling code from a remote git repository
  6. Creating a couple of small (empty) databases and user accounts
  7. Tweaking some configuration files
  8. Performing bundle install on a rails project
  9. Performing rake tasks to set the database schema and seed the database

These are relatively I/O intensive operations, but also involve CPU tasks (compiling code) and network access (downloading sources and packages), so should provide a reasonable benchmark for comparing the performance of those cloud providers.

Results

These highly-unscientific results are quite basic. No fancy charts or anything. All I measured was how long the entire bootstrap operation was taking on each of the cloud providers.

Rackspace 256: 1269 seconds (~21 minutes)
Rackspace 512: 1144 seconds (~19 minutes)
Linode 512: 1053 seconds (~17.5 minutes)
EC2 micro: 4090 seconds (1 hour and 8 minutes!!??)

Linode seems to be the winner, running around 20% faster than Rackspace 256 and 8% faster than rackspace 512. What’s much more surprising however (for me anyway), is how slow EC2 is in comparison, running 378% slower than Linode… I am guessing this is down to EBS storage. Quite a big performance hit for persistent storage though.