WestDRI: UBC RC - Formats
Date: 4 November 2022 @ 20:00 - 21:30
Timezone: UTC
Language of instruction: English
Which file format should you use when saving your research dataset? Besides the obvious question of how to encode your data structures in a file, you might also want to consider portability (the ability to write/read data across different operating systems and different programming languages and libraries), the inclusion of metadata (data description), I/O bandwidth and file sizes, compression, and the ability to read/write data in parallel for large datasets. In this in-person introductory workshop, we will cover all these aspects starting from the very basic file formats and ending with scalable scientific dataset formats. We will cover CSV, JSON, YAML, XML, BSON, VTK, NetCDF, and HDF5, using both structured and unstructured datasets and demoing all examples in Python.
Keywords: Python, Programming
Activity log