The relevant columns for this dataset relating to publications for each sample are
- Författare = authors
- Tryckår = year
- Titel = title
- Tidskrift = (no SEAD equivalent, but can be added to full_reference)
- Förlagsort = (no SEAD equivalent, but can be added to full_reference) To make full_reference combine all of the above using the Extra Columns tab, and the expression “{author} {publication_year} {titel} {tidskrift} {publicaiton_city}” (or whatever the column names turn out to be)
- create a data-derived (from Entity datasheet) entity biblio, with biblio_id
- in the Basic tab choose as Columns: forfattare, tryckar, forlagsort, tidskrift, titel
- add an Extra Column called full_reference using {forfattare} {tryckar} {titel} {tidskrift} {forlagsort}
- drop duplicates based on the full_reference
name: biblio
type: entity
system_id: system_id
keys: []
columns:
- forfattare
- tryckar
- forlagsort
- tidskrift
- titel
public_id: biblio_id
source: datasheet
drop_duplicates:
- full_reference
check_functional_dependency: false
extra_columns:
full_reference: '{forfattare} {tryckar} {titel} {tidskrift} {forlagsort}'
- create the Entity site_references and do the joins
