EOSC Cloud Validation Test Suite

This tool is intended to be used to test and validate commercial cloud services across the stack for research and education environments and it is being used as a validation tool for commercial cloud services procurement in European Commission sponsored projects such as OCRE, ARCHIVER and CloudBank EU. For more information about the European Open Science Cloud (EOSC) visits its website here.

Please find here the repository hosting the source code of this tool.


The test-suite executes four main steps:

  1. Infrastructure provisioning: VMs are created using Terraform and then with these several Kubernetes clusters are bootstrapped by Ansible according to the selected tests.
  2. Deploy the tests: Kubernetes resource definition files (YAML) are used to deploy the tests.
  3. Harvest results: at the end of each test run a result file -written in JSON- is created. This file is collected from the pod running on the cluster.
  4. Optionally, destroy resources.

The test set described in the Tests Catalog section of the documentation is originally based on the tests used in the Helix Nebula The Science Cloud PCP project funded by the European Commission.

The developers would like to thank all test owners and contributors to this project.

The latest version of the suite has been tested on:

OS running on provider’s VMs CentOS7, Ubuntu 18.04, Ubuntu 20.04
Providers / clouds
AWS
Google Cloud
Microsoft Azure
Oracle Cloud Infrastructure
Exoscale (CloudStack)
T-Systems’ Open Telekom Cloud (OpenStack)
CERN Private Cloud (OpenStack)
CloudFerro (OpenStack)
CloudStack
OpenStack
Ionos
CloudSigma
OVH
IBM
CityNetwork
X-Ion
Layershift
Orange’s Flexible Engine
Yandex Cloud

The suite is continuously tested in new cloud providers. As tests are concluded, the cloud providers names will be added to the table above.

Release notes

(Note the versions are numbered with the date of the release: YEAR.MONTH)

21.12 - latest

  • Included deployment support for Ionos, CloudSigma, OVH, IBM, CityNetwork, X-Ion, Layershift, Orange and Yandex.
  • Updated CPU Benchmark: using new Hepscore based implementation.
  • Enabled prompt asking for confirmation prior to starting the run in case of existing Terraform files.
  • Network test (perfSONAR) update: added reverse throughput measurement & retries.
  • Complete networking/VPC provisioning: AWS, Openstack, IBM, Orange, OCI, Ionos and Yandex.
  • Added option –freeMaster to disable running tests/benchmarks on the master node.
  • Updated results upload to CERN’s private Object Storage.
  • Implemented a dashboard to visually display results. While the actual deployment is not public, its source code can be found here.

21.4

  • Cluster certificate additionally signed for NAT IP (no need to use bastion method, with this solution the cluster can be reached from outside of the provider network. However, previous allocation of floating IPs is now required).
  • Added –usePrivateIPs option for bastion’s method.
  • Removed CloudStack Terraform support (the provider’s repository was archived by HashiCorp).
  • Allowed both project-wide and VM-specific ssh key on GCP.
  • Improved configuration: select network.
  • Updated Distributed GAN test: included NNLO implementation ; more configuration (dataset size).
  • Added ProGAN test.
  • Allowed subset of costs (general configuration YAML file) and tests (tests catalog YAML file).
  • Allowed relative paths for -c and -t.
  • Updated CPU benchmark, based on the HEP Benchmarking Suite.
  • Added option –noWatch to run without displaying logs, without watch command.

20.6

  • Improved support for running on Oracle Cloud Infrastructure and T-Systems’ OTC.
  • Added option –customNodes to set the number of instances that should be deployed for the shared cluster.
  • Using Terraform’s yamldecode with configs.yaml for variables instead of Python’s replace function with placeholders.
  • Disabled general Terraform support: only the providers and clouds that support Terraform and are present on the table above are fully supported by this suite. To run on another provider (supporting Terraform or not), the option ‘–noTerraform’ has to be used.

20.2

  • Using Ansible for VM configuration instead of Terraform’s provisioners.
  • Added support for non-Terraform providers (only bootstrap phase).
  • Added options to destroy provisioned infrastructure.
  • Added options to specify custom paths to configs.yaml and testsCatalog.yaml.
  • Added support to use Ubuntu on VMs.

19.12

  • Project restructured.
  • Improved support for running on Google, AWS, Azure, Exoscale, OpenStack and CloudStack.

19.8

  • Parallel creation of clusters, with different flavors according to tests needs.
  • New logging system to keep parallel running tests logs sorted.
  • Restructured configuration: moved configuration files to /configurations and created new files taking HCL code (terraform configuration code) to keep configs.yaml clean.
  • Automated allowance of root ssh by copying open user’s authorized_keys to root’s ~/.ssh as well as sshd_config modification.
  • Usage of Kubernetes API instead of Kubernetes CLI.
  • For network test (perfSONAR), usage of API instead of pscheduler CLI.
  • New test: Dynamic On Demand Analysis Service, provided by INFN.
  • Added configurations validation with jsonschema.
  • Created Docker image to run a Test-Suite launcher container: rapidly creates a ready to use Test-Suite launcher.

19.4

  • New tests: network performance (perfSONAR) and CPU benchmarking.

19.2

  • First release.

Contact

For more information contact ignacio.peluaga.lozada AT cern.ch

License

Copyright (C) CERN.

You should have received a copy of the GNU Affero General Public License along with this program. If not, see gnu.org/licenses.