Storage usage reports for administrators¶
If you are an administrator of a Sunet Drive node, you will get access to a shared folder from Sunet, containing a daily statistics of how many users are provisioned on your node, and how much storage you use. You will see two files that are updated every night, while the last files are moved to the subfolder “daily”:
-latest.csv\ Contains a timestamp, as well as two entries: Total GB and Total users. -latest-detailed.csv\ Contains a detailed description of all storage components of your node
Here is what you can expect to see in the
- Primary storage\ Contains an overview of everything that lies outside a projectbucket or the userbuckets.
- Database\ All of the metadata from Nextcloud resides in the database, as well as some binary files, including file versions and deleted files
- Projectbuckets and userbuckets\ An overview of the buckets that have been assigned to research project
- Backup/Mirror\ An overview of what we have mirrored to the other datacenter with a minimum data retention time of two months; your institution might have a longer retention time
If you assign a bucket to a research project, all files stored via the sync client will reside in this bucket, in a regular file and folder structure. If a user now creates a new version of a file, or deletes the file, there will be additional data stored in the primary storage, as well as the database. Backups are created during the night, which will then also determine the latest version that can be found in the backup of a project bucket.
This schema creates a bit of an overhead of data that is stored outside of project buckets. Depending on how many versions of a file exist, and depending on the data retention time of your backups, around 15-30% of overhead is created.
The total amount of storage for a research project therefore is:
- The amount of data stored in sto4-XXX
- Plus the amount of data stored in sto3-XXX (backup/mirror)
- Plus the above mentioned overhead of around 15-30%