Datasets¶
A dataset in a Dataverse installation is a container for your data, documentation, code, and the metadata describing this Dataset.

A dataset contains three levels of metadata:
- Citation metadata: any metadata that would be needed for generating a data citation and other general metadata that could be applied to any dataset;
- Domain specific metadata: with specific support currently for Social Science, Life Science, Geospatial, and Astronomy datasets; and
- File-level metadata: varies depending on the type of data file - for more details see File management section below).
Create a dataset¶
-
Navigate to the Dataverse collection in which you want to add a dataset.
-
Click on the “Add Data” button and select “New Dataset” in the dropdown menu.

-
To quickly get started, enter at minimum all the required fields with an asterisk (e.g., the Dataset Title, Author Name, Description Text, Point of Contact Email, and Subject) to get a Data Citation with a permanent link.
-
Scroll down to the “Files” section and click on “Select Files to Add” to add all the relevant files to your Dataset. You can also upload your files directly from your Dropbox.
-
Click the “Save Dataset” button when you are done. Your unpublished dataset is now created.
-
Add additional metadata once you have completed the initial dataset creation by clicking the Edit button and selecting Metadata from the dropdown menu.
Edit a dataset¶
Once you’ve created a dataset, you can edit it at several levels to keep it up to date and well-organized. Here’s how you can make changes and what each option means for managing your dataset:

-
Files (Upload): You can add new files to your dataset, replace outdated ones, or remove files that are no longer needed. This helps ensure your dataset reflects the most current version of your research.
-
Metadata: If you need to update details like the title, description, or authors of your dataset, you can do so by editing the metadata. This allows you to keep your dataset accurate and up-to-date.
Moreover, once your dataset is created, editing it will also reveal additional metadata fields that weren’t visible when you first created the dataset. These extra fields allow you to provide more detailed information about your dataset. If you selected a specific domain when creating the Dataverse (e.g., Social Science or Astronomy), the new metadata fields shown during editing will include domain-specific options. Adding this extra metadata makes your dataset more complete and easier for others to find and understand. -
Terms: You can revise the terms of use for your dataset at any time. This includes setting or updating restrictions, licenses, and disclaimers to clarify how others can access and use your data. See detailed documentation here.

-
Permissions: Permissions allow you to control who can access or modify your dataset. For instance, you can grant collaborators editing privileges, restrict file access, or make the dataset publicly available. Read more about managing permissions in the Dataset-level section in the original documentation.
-
Private URL: If you need to share your dataset privately before publishing it, you can generate a private URL. This link lets others view or download the dataset even if it isn’t publicly accessible yet. Learn how to create and use private links in the Dataverse Documentation.
-
Thumbnails and Widgets: Make your dataset stand out with a thumbnail image or embed widgets to share it dynamically. A thumbnail adds a visual identifier for your dataset, while widgets allow for live previews or sharing functionality. For setup instructions, see the Dataverse Thumbnails + Widgets section.
-
Delete Dataset: If a dataset is no longer needed, you can delete it. However, keep in mind that deleting a dataset is permanent and can only be done by users with the right permissions. Before deleting, make sure it is no longer required for ongoing projects or collaborations.
Mandatory steps for BSC datasets submission¶
Once you have created a dataset, you must always edit it before submitting it for revision.
Add more metadata¶
Click the Edit dataset
> Metadata
button (Figure 3). When you create a dataset, only a limited number of metadata fields are initially available. After clicking this button, many additional fields will become accessible.
Complete the relevant metadata blocks. By default, you will see the citation fields (see here). Additional metadata blocks may also be available, depending on your Dataverse Stewards' configuration. These include:
- Geospatial (see fields here)
- Social Science and Humanities (see fields here)
- Astronomy and Astrophysics (see fields here)
- Life Science (see fields here)
- Journal (see fields here)
Add IPR metadata¶
Click the Edit dataset
> Terms
button (see Figure 3). When you create a dataset the license added by default is CC-BY, which means that others are free to share, copy, redistribute, and adapt the dataset, even for commercial purposes, as long as proper credit is given to the original creator.
Please, check the CC Licenses section for more information. You can modify the license if needed, although from BSC we do not recommend using CC0. You can find this field under “Dataset Terms”.
If required, you can create your own Terms and Conditions by selecting the corresponding option:

Add a guestbook¶
Click the Edit dataset
> Terms
button (see Figure 3). Under the section “Guestbook” you must select a guestbook.
Use the BSC Guestbook, which is already registered in the Dataverse collection you are working with. If necessary, your Dataverse Stewards can create a custom guestbook with specific fields tailored to your needs. If this is required, please contact your department’s Dataverse Steward.