Data loading environment and process

Note: These initial notes describe how the deployment team at UC Berkeley is loading data in CollectionSpace deployments.

Background and overview

More to come.

System requirements (very drafty)

  • Eclipse 3.4 or better, maven 2.2, JDK version 1.6
  • OS: We are working in linux but will be testing data loading in other operating systems.

Setting up the environment (very drafty)

get whole release: e.g.,  svn co https://source.collectionspace.org/collection-space/src/services/trunk .

get collectionspace.importer.pahma.jar for release (which Aron has made in the past)

update pom.xml to new release
In the properties section: change the value of <cspace.services.client.version> to match the current services release.
Optionally, you can also change the ArtifactId and name to something appropriate. e.g. "collectionspace.importer.myinstancename". It is not required, but probably is a good idea.

mvn install

in Eclipse: import, materialize

Update java code for this release

Note: The code used to load the data is based on utility methods and integration test code in the full release: e.g.,https://source.collectionspace.org/collection-space/src/services/trunk/services/collectionobject/client/src/test/java/org/collectionspace/services/client/test/CollectionObjectServiceTest.java
and samples that Richard and Aron created for early releases.

Note: We will attach some samples here.

Version control for data loading programs and data files.  More to come.

Creating data file

More to come.

Loading the data file

More to come.

Verifying and testing data

More to come.