Traces from existing parallel and distributed computing systems are a useful resource for researchers to replicate real-life workloads for their experiments. However, since cloud computing is a relatively new area few such traces are currently available.

Using Chameleon Cloud Traces

We have developed a trace data structure based on data from OpenStack Nova/Blazar/Ironic services, as well as software to extract the appropriate data. We are making available data from the OpenStack cloud operated by the Chameleon testbed for educational projects.

We released Chameleon cloud traces to enable researchers to run their experiment and/or simulations with more realistic scientific testbed data. It would be a huge encouragement to us to see your works using our cloud traces. If you use our data in your research, it would be great to let us know. You can find our contact information here.

For a quick start on downloading and using our cloud traces, we provided a Jupyter Notebook example, which you can uploadĀ to Chameleon Jupyter server. For more information about Chameleon Jupyter interface, please visit Chameleon user documentation.

Sharing Your Cloud Traces

We also encourage research groups and cloud testbed providers to share their cloud traces with us. To share your cloud traces, please provide the following information with your cloud traces:

  • Cloud trace format and format version
  • How the cloud trace was generated (i.e. software tools)
  • Cloud trace period
  • Release date
  • Contact information
  • (Optional) Publications

If you are interested in sharing your cloud traces with us, please contact us!

Other Released Cloud Traces

The commercial cloud providers have released their cloud traces, and numerousĀ researches had been done based on these released traces. We listed Microsoft Azure, Alibaba and Google release below for your reference.