press the + ADD EXTRA COLUMNS button to open the dialog boxes

Use this tab to create the additional columns that the dataset will need for ingestion. This is particularly useful for columns that apply to an entire dataset, such as assigning an analysis method, or a country location. Or you can copy an entire column and give it a new name, which is useful for creating a join based on a specific column name.

boxes to fill intype of boxwhat to put here
New Column Namefree-formthe name of your new column
Expressionfree-formenter a formula or value that will fill the column with the appropriate information

It can also be used to help pull out data that only appears in some of the rows of the dataset.

For example, imagine a dataset where some of the sites have a lämningsnummer (The Swedish National Heritage Board’s national site identifier number).

  1. Create a data-derived entity from the source dataset called site_property (with a public ID of site_property_id)
  2. In the Columns box select the column lämningsnummer
  3. In the Extra Column Tab: create a new column called site_property_type_id and set its Expression to 1. This will add that column to the dataset, and for every row that there is information in the lämningsnummer column, then the new column will have a value of 1.

This can be done in conjunction with creating a fixed value entity called site_property_type which adds a column (in the column box on the Basic tab) called site_property_name, plus two manually created rows (+ ADD ROW button), with “National site identifier” in row 1. subsequent rows would be added for any other site property names relevant to the dataset, and entities to define them would be created.

There are onscreen help suggestions with examples of various options to use during column creation

these come in a format with the name of the command, then example syntax, then a text description of what happens, like this:

  • Copy: existing_column - Reuse a source column under a new name

in these cases, the command name is not entered into the cell. So in this example, you would enter only the name of an existing column, and the new column will then have all of the same information as that original column.