Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 16 Current »

With CollectionSpace 2.0, the schema have changed, and therefore so will the data mapping and customizations we are employing in the UCJEPS deployment of CollectionSpace 1.0.  This is not a final data mapping for the University and Jepson Herbaria. This is just a next iteration.  Changes from the previous mapping are in blue.

This iteration will focus on object cataloging data, persons, organizations, taxonomy and SMaSCH vouchers (which are really text annotations), and the relationships needed to support those.  Mapping  information about Loans In is included.

  File Modified
You are not logged in. Any changes you make will be marked as anonymous. You may want to Log In if you already have an account.
No files shared here yet.
  • Drag and drop to upload or browse for files
  • Object cataloging data

    Field mapping

    SMaSCH

    CollectionSpace object schema

    CS User Interface Title (** = different from UI)

    Note

    accession.accession_id

    object_number

    Accession Number **

    Change field title

    institution.inst_name

    collection

    Collection

    Dropdown (Collection values)

    accession.coll_num_prefix + coll_number + coll_num_suffix

    other_number

    Number

    Moved to Field Collection Number.

    "collector number"

    other_number_type

    Number Type

    Not needed if we move this to Field Collection Number.

    accession.notes

    comments

    Comments


    objkind.kind

    form (Object Description Information)

    Form

    Dropdown (Form values)

    taxon_fullname.fullname

    title

    Taxon **

    Change field title

    accession.phenology (decoded)

    phase (Object Description Information)

    Phenology **

    Change field title. Dropdown (Phenology values)

    accession.early_jdate (calc)

    field_collection_date_earliest

    Field Collection Date Earliest

    Custom 

    accession.late_jdate (calc)

    field_collection_date_latest

    Field Collection Date Latest

    Custom

    accession.datestring

    field_collection_date

    Field Collection Date

    We are converting this to the structured date format

    committee.committee_abbr

    field_collection_collector

    Field Collection Collector


    accession.loc_lat_decimal

    field_loc_lat_decimal

    Field Location Latitude Decimal

     

    accession.loc_long_decimal

    field_loc_long_decimal

    Field Location Longitude Decimal


    accession.loc_place (calc)

    field_collection_place (?)

    Field Location Verbatim**

    Will map to verbatim field in schema extension for now.  When the Place Authority is available, we will work on creating authority records as well.

    accession.loc_county

    field_loc_county

    Field Location County

    Custom

    accession.loc_state

    field_loc_state

    Field Location State

    Custom

    accession.loc_country

    field_loc_country (temp custom)

    Field Location Country

    Custom

    accession.loc_elevation

    field_loc_elevation (temp custom)

    Field Location Country

    Custom

    accession.coll_num_prefix + coll_number + coll_num_suffix

    ?

    Field collection number


    committee.committee_abbr

    ?

    Field collection collector


    "catalog date"

    date_association

    Date Association

    Fixed text. Do not load into 1.0.  May end up in Associated Date information eventually, or create date/modify date.

    accession.catalog_date

    catalog_date

    Date Text

    Fixed text. Do not load into 1.0.  May end up in Associated Date information eventually, or create date/modify date.

    SQL query

    view: cspace_collobj_2_0

    Notes

    • loc_county, loc_state, and loc_country will not be supported by dropdowns for now
    • Collecting event information is being added as custom temporary fields.  CollectionSpace will have a separate authority system for collecting events and collectors, so these mappings will change in the future.

    Loans Out

    Field mapping

    SMaSCH

    CollectionSpace loans out schema

    CS User Interface Title (** = different from UI)

    Note

    loan_event.uc_loan_num + loan_event.jeps_loan_num

    Loan out number

     

    smasch has two types of loan identifiers.
    Loan Number relates accessions in a single collection that make up a loan.
    Loan ID relates loans from each collection that make up a loan.
    "Loan Number" = collection + loan_num
    "Loan ID" = loan_id

    loan_event.curr_inst

    Borrower

     

    Probably should keep this as borrowing institution (organization authority)

    loan_event:         
      curr_for_fname,
      curr_for_lname,
      curr_for_position

    Borrower's contact

     

    Probably should keep this as borrowing agent (person authority)

    loan_event.inhouse_notes + loan_event.noteworthy_inclusions

    Loan out note

     

     

    loan_event.loan_status +
    loan_detail.return_status

    Loaned object status

     

    It looks like this field is not currently active.  smasch keeps track of two status types:
    loan status ('Active' or 'Cancelled')
    return status ('All in', 'All out', 'Partial', 'Discrepancy', 'Unknown')

    Loan date?

     

    Loan Out Date

    Does SMaSCH have a loan date?

    SQL query

    view:

    Notes

    ...

    Object to Loan Out Relationships

    • On the way

    Data sets for dropdowns and controlled lists

    1. Collection values:
      1. Jepson Herbarium
      2. University Herbarium, UC Berkeley
      3. University of California
    2. Form values:
      1. Illustration
      2. Mounted on Paper
      3. Photocopy
      4. Photograph
      5. Stored in a Box or Bag
    3. Phase (Phenology) values:
      1. Cone
      2. Flowering
      3. Flowering/Fruiting
      4. Fruiting
      5. Spores/Sporangia
      6. Vegetative (non-reproductive)
    4. Loaned Object Status values:
      1. Active (Unknown)
      2. Active (All Out)
      3. Active (All In)
      4. Active (Partial)
      5. Active (Discrepancy)
      6. Cancelled (Unknown)
      7. Cancelled (All Out)
      8. Cancelled (All In)
      9. Cancelled (Partial)
      10. Cancelled (Discrepancy)
    5. Other Number Type values:
      1. Other herbaria accession number
      2. Copied from accession number
      3. Exsiccatae number
      4. Project number
      5. Genbank number
      6. Destructive sampling number
      7. Internal cross reference
      8. Miscellaneous

    Name Authority: Person

    Field mapping

    SMASCH

    cSpace Person Schema

    CS User Interface Title (** = different from UI)

    Note

    person.person_id
    borrower.borrower_id

    Legacy ID

     

    borrower is not part of smasch; newly created to generate pk id for borrowers in loan_event table.

    person.first_name +
         person.other_name +
         person.last_name
    loan_event.curr_for_fname +
         loan_event.curr_for_lname

    Display Name

     

     

    person.first_name
    loan_event.curr_for_fname

    Forename

     

     

    person.other_name

    Middle Name

     

     

    person.last_name
    loan_event.curr_for_lname

    Surname

     

     

    person.bio_notes

    Biographical Note

     

     

    person.birth_year

    Birth Date

     

     

    person.death_year

    Death Date

     

     

    loan_event.curr_for_position

    Occupation

     

     

    loan_event.curr_inst

    Group

     

     

    'collector' + '; name source: ' + data_source.data_src_name  
    'borrower'

    Name Note

     

     

    SQL query

    View: cspace_person_2_0

    ETL

    Talend job GET_PERSON_V1 (Hoffman):

    • Input: Query view
    • See  SMaSCH data mapping notes for some special data handling
    • Output: XML file for loading (batches of 3000)
    • Output: List of CSIDs (to facilitate deleting)
    • Output: CSV file of basic identifiers (for record keeping and as input to one of the Org loading jobs).
    • 8945 rows

    Notes

    TBD

    Name Authority: Organization

    Field mapping

    SMASCH

    cSpace Organization Schema

    CS User Interface Title (** = different from UI)

    Note

    committee.committee_id
    institute.inst_abbr

    Legacy ID

     

     

    committee.committee_abbr
    institute.inst_name

    Display Name

     

     

    committee.committee_name
    institute.inst_name

    Long Name

     

     

    committee.committee_abbr
    institute.inst_abbr

    Short Name

     

     

    institute.inst_state +
        institute.inst_country

    Foundation Place

     

     

    committee.committee_func
    'Institution'

    Function

     

     

    SQL query

    view: cspace_org_2_0

    ETL

    Talend job GET_ORG_V2 (Hoffman):

    • See SMaSCH data mapping notes for some special handling
    • Input: Query view
    • Input: ucjepspersonout-ids.txt from GET_PERSON_V1 (to get refnames of organization contacts from person authority)
    • Output: XML file for loading (batches of 4000)
    • Output: List of CSIDs (to facilitate deleting)
    • Output: CSV file of basic identifiers (for record keeping).
    • 49845 rows

    Talend job (Cheng):

    • Input: Text file from Dick Moe, non-colliding-collectors.in, containing names of seaweed collectors
    • 1546 rows

    Notes

    ...

    Organization Contact schema

    Field mapping

    SMASCH

    cSpace Contact Schema

     

    CS User Interface Title (** = different from UI)

     

    Note

    institute.email

    Email

     

     

    institute.fax

    Fax

     

     

    institute.url

    URL

     

     

    institute.address1 +
    institute.address2 +
    institute.city +
    institute.zipcode +
    institute.state +
    institute.country

    Place

     

     

    "street"

    Address Type

     

     

    institute.phone

    Phone

     

     

    • No labels