Paper: [IMC'21] From Cloud to Edge: A First Look at Public Edge Platforms
Dataset list
- NEP-small: The workloads traces of 14 edge sites of China's largest public edge platform during June 2020 (5-min per record).
- NEP-large: The data of 139 edge sites of China's largest public edge platform during June 2020 (5-min per record). (In this data set, we additionally provide three-month continuous records for bandwidth trace, but the granularity is 1 hour.)
Each dataset contains the workloads traces (CPU, memory, storage, bandwidth) at virtual machine (VM) granularity from a large-scale edge cloud platform in China.
Specifically, each dataset consists of five CSV files.
- VM_CPU.csv: the CPU usage data of each VM;
- VM_BANDWIDTH.csv: the bandwidth (both uplink and downlink) usage data of each VM;
- SITE_RTT.csv: the RTT among edge sites;
- VM.csv: the VM affiliation table, including customer id, mirror id, specifications, etc;
- PM.csv: physical machine(PM) ID, the CPU, memory, and storage size of each physical machine.
(1)
vm_idis the VM ID; (2)site_idis the edge site ID; (3)cpu_rateis the CPU usage. For example, 0.6 indicates 60%; (4)report_tsis the Unix timestamp of data collection.
pubis the public network,priis the internal network,upis the uplink,downindicates the downlink,bwis the bandwidth usage (bps), andflowis the number of flows
uidis the customer ID;pm_nameis the physical machine (PM) ID where the VMs are hosted;statusis the status of instance;image_idis the system image ID used by the instance;os_typeis the OS type;os_nameis the OS name.
cores,memory, andstorageare the number of CPU cores, memory size (MB), and disk size (MB) of the physical machine, respectively.
- The data can be only used for research purpose.
- The data can not be shared offline.
To request the data, please submit a form here: https://forms.gle/j3QDp9qtCVyrcTwm9, and we will response AFAP. Contact Mengwei Xu (mwx@bupt.edu.cn) for any inquiry.




