-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathChangeLog.txt
More file actions
220 lines (219 loc) · 31.8 KB
/
ChangeLog.txt
File metadata and controls
220 lines (219 loc) · 31.8 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
List of changes to the migrator template, in reverse chronological order of implementation:
2021-11-12 Added invocation of dynamicProperties update for ChronometricAges to the macro ChronometricAge DynamicProperties.
2021-10-23 Updated ChronometricAges for ratified version.
2021-02-04 Removed the 60 records from the LegacyDates table in templates/LegacyTemplate.mdb. These would have affected the first 60 records in any migrator if not removed first.
2020-11-19 Put the Dynamic Properties macro back in. Updated to review version of ChronometricAge extension.
2020-06-06 Major overhaul to ChronometricAges to allow a list of them in dynamicProperties.
2020-05-29 Added ChronometricDatesTemplate table and step to copy it to ChronometricDates table to insure that it is empty before processing.
2019-07-15 Updated to new ChronometricAge extension.
2018-11-03 Amended aggregation scripts to avoid lost records when cat-ing split files that did not have LF on the last row.
2018-10-08 Added query "Update dynamicProperties for StreamWidth".
2018-10-08 Amended LegacyElevation processing to interpret '0' and '0.0'. Amended query "Update dynamicProperties for Weight" to match field [Weight] as a default instead of the zooarchaeology pattern.
2018-10-07 Amended LegacyDates processing to avoid interpretation of '0000' in verbatimDate as a real year.
2018-09-19 Amended query "Update SimpleDwC - references" to substitute '-' for ':'.
2018-07-14 Separated out modified queries to each deal with one field only in an attempt to avoid resource exceeded messages on large data sets.
2018-06-30 Updates to implement the new version of the ChronometricDate extension.
2018-06-12 Amended macro "Reports" to invoke query "Report - elevations reversed" correctly, the Table Name had an extra space in it.
2018-06-03 Modified PurgeNonprintingCharacters.sh to NOT remove spaces before double quotes. Spaces are not non-printing characters and removing them this way makes it very hard to find the real problems of non-printing characters.
2018-06-02 Added processing of trailing single quote character (’) in queries "LegacyCoordinates processing 13" and "LegacyCoordinates processing 14".
2018-06-02 Added three missing reports to macro "Reports". Fixed query "ReportSummary - coordinateUncertainties out of range", which had Group By instead of Count for the field occurrenceID. Not very much a summary that way. Doh!
2018-05-27 Added processing for dates with the pattern yyyy-mm-.
2018-05-21 Changed the query "Update Location - higherGeography key" to not trim higherGeography. Doing so resulted in duplicate higherGeography values, which has the potential to have inconsistent resolution and creates a much more demanding join in the query "Update Location 2 - higherGeography". Made associated change in the query "Update higherGeography from verbatim values" in VocabularyManager.mdb.
2018-05-21 Amended Aggregator to correct invocation of query "Append Plants to SimpleDwCForIPT".
2018-05-17 Amended Prepend queries that were reference the table SimpleDwC-verbatim from Specimens as their source of values to use the table Specimens instead. There is no functional difference from the previous version since these are customized queries anyway.
2018-04-27 Added scientificNameAuthorship to scientificName construction in query "Update Taxon2 - scientificName" and took away condition scientificName is Null.
2018-04-07 Removed cast to Str in query "Update dynamicProperties for Weight" to avoid losing all previously processed dynamicProperties if the value of the mapped weight field is a string already.
2018-04-06 Corrected Legacy Depths processing to not lose DepthUnits in step 06.
2018-04-02 Added ChronometricDate support.
2018-02-12 Amended misspelling in macro query invocation "Append Plants to SimpeDwCForIPT".
2018-01-09 Added queries for zooarchaeology preparations. Added queries for zooachaeology dynamicProperties.
2018-01-08 Further modified scripts "1c - RunAggregators.bat" and PurgeNonprintingCharacters.sh to use avoid out of memory errors.
2017-12-25 Changed file encoding of PurgeNonprintingCharacters.sh from utf8 to ANSI so that it executes properly in sh shell.
2017-12-23 Added ResourceRelationship processing to AggregatorTemplate.
2017-12-22 Changed substitution character combination in PurgeNonprintingCharacters.sh from ññ to @&@ to avoid potential encoding issues in some shell execution environments.
2017-12-03 Added a host of dynamicProperties queries from LACM Fish collection.
2017-12-03 Added dynamicProperties queries for air temperature and groundtemperature. Added query for cleared and stained preparations.
2017-12-03 Corrected query "Update SimpleDwC - references" to always use the occurrenceID.
2017-12-02 Added bibliographicCitation builder.
2017-12-02 Corrected encoding and substitution characters in PurgeNonprintingCharacters.sh. Not sure where the encoding change occurred, as it does not appear in the commit history.
2017-12-01 Changed the ReportSummary output column to "records affected" from "count".
2017-11-30 Added depth reversal and elevation reversal.
2017-11-30 Added reports for depths reversed and elevations reversed.
2017-11-30 Added ReportSummary reporting. Gives list of reports that have at least one record in the specific report.
2017-11-07 Enabled the template migrator to run as delivered, without customization.
2017-11-07 Added renaming of Report for uninterpretable verbatimEventDate.
2017-11-02 Further amendment of non-printing character purging script to accommodate Text and avoid memory errors with sed for long lines.
2017-11-02 Amended query "Update Taxon 2 - subspecies to null for null species". Was the same as query "Update Taxon 2 - species subspecies to null for null genus".
2017-11-02 Removed two unneeded versions of the Do DwC2 migration macro.
2017-11-02 Further amendment of non-printing character purging script to accommodate Text.
2017-11-02 Amended latest AggregatorTemplate to make processingID not a primary key, lest records be lost when aggregating two or more sources. This was a recently introduced error.
2017-11-01 Added 'Text' as a type value to detect non-printing character issues.
2017-11-01 Added 'Text' as a type value to detect at the beginning of a line to discover new line issues.
2017-11-01 Changed SourceTemplate.mdb to SourceTemplate-Verts.mdb along with references to it in DwC2migration.bat to be able to have multiple sources when aggregating original data sets.
2017-11-01 Added query "Update dynamicProperties for Original Source". Amended query "Update dynamicProperties for OvumLength".
2017-10-26 Added "Report - verbatimEventDate not interpretable".
2017-10-26 Added missing fields samplingSizeValue and samplingSizeUnit to SimpleDwCForIPTquery in AggregatorTemplate.mdb.
2017-10-21 Corrected Report queries for non-standard genus, family, order.
2017-10-20 Removed linked Occurrence table in SourceTemplate.mdb as well as the macro to load the Specimens table in that database. They were confusing and unnecessary.
2017-10-20 Added occurrenceID to relevant reports.
2017-09-20 Renamed placeholder file ForIPT\SimpleDwCForVertNet.csv to ForIPT\SimpleDwCForIPT.csv. No functional change to the migrator.
2017-09-20 Added necessary copies of database templates into workspace.
2017-09-18 Amended query "Update Taxon2 - nomenclaturalCode" to consult a new vocabulary table called "phyla", which matches phylum to kingdom and nomenclaturalCode.
2017-09-18 Added copies of Source.mdb, Legacy.mdb, and VerbatimDwC.mdb to workspace folder to facilitate migrator customization, as there are references to these database in the workspace in the DwC2ExtractTemplate.
2017-09-18 Major overhaul to separate working tables into distinct databases to save space and allow for undivided processing of much larger data sets. Should allow for datasets of at least 500k records. Also simplified maintenance by making a single DwC2 migration bat files that is parametrized in calls from the script 1a - Run migrators.bat. Removed export specifications from Report query exports resulting in exports of .csv files rather than .txt files.
2017-09-18 Updated reports to consistently use institutionCode along with collectionCode and catalogNumber. Cleaned up all reports to be consistent with new Report Explanation documents.
2017-09-17 Made report queries and output file names consistent with the casing of Darwin Core terms (e.g., "Report - non-standard islandGroup" instead of "Report - non-standard IslandGroup". This was to make the report explanation documentation consistent and less confusing. No functional change to the migrator.
2017-09-13 Amended query "Update from LegacyScientificNames" to capture the identificationQualifier.
2017-09-13 Simplified macro "Do DwC2 migration" by removing sections of it to other macros, which are now invoked with RunMacro within "Do DwC2 migration".
2017-09-13 Updates migrator macro "Do DwC2 migration" to enable all Legacy queries by default. It is easier to customize the queries called "LegacyXXX from Specimens" to propagate no changes (with processingID <0) than it is to enable and disable queries in the new macro editor.
2017-09-13 Amended AggregatorTemplate.mdb to rename queries and tables with "ForVertNet" to "ForIPT" instead. Also amended Append queries and SimpleDwCForIPT structure to contain the new fields added in version 2017-08-17.
2017-09-11 Changed links to vocabulary lookup tables in DwC2ExtractTemplate-Verts.mdb to point to VocabulariesMaster.mdb rather than VertNetVocabularies.mdb.
2017-09-11 Removed incorrect mapping of verbatimLongitude to verbatimCoordinateSystem in query "SimpleDwC - verbatim from Specimens". Not sure when that crept in.
2017-09-11 Added paths for default locations of Access.
2017-09-10 Corrected query "Lookup - organismScope".
2017-08-26 Added vocabulary management for coordinatePrecision, identificationVerificationStatus, license, nomenclaturalStatus, organismScope, day, and taxonomicStatus.
2017-08-18 Corrected references in all queries "SimpleDwC without matching LookupX" to change the reference from ID to occurrenceID, since ID was removed from the SimpleDwC structures. Previous version threw parameter errors when running Do DwC2 migration macro.
2017-08-17 Changed the structure of the default linked file to match the complete Simple Darwin Core to facilitate making Event- and Taxon-based migrators. Changed the structure of SimpleDwC-verbatim and SimpleDwC to match. Added hitherto missing Event and Occurrence fields to the query "SimpleDwC2 - verbatim from Specimens". Removed the query "SimpleDwC2 - verbatim from Specimens - common". We did not use it in practice.
2017-08-17 Added extensive commentary and indicators of required and to-be-customized queries in the macro "Do DwC2 Migration".
2017-08-17 Amended queries "Append new month to LookupMonth from month" and "Append new month to LookupMonth from endmonth" to correctly add new vocabulary variants from those fields.
2017-08-16 Amended query "Update Taxon 2 - nomenclaturalCode" to apply changes only if nomenclaturalCode is null.
2017-08-14 Amended misspelling in query "Update dynamicProperties for SnoutVentLength". Confirmed that the misspelling did not affect any records published in VertNet.
2017-06-12 Added verbatimCoordinates to the default mapping list for the query "Legacy Coordinates from Specimens".
2017-06-12 Amended the expressions to be explicit on quoting for all dynamicProperties updater queries so that the quoting does not cause them to fail.
2017-06-04 Improved the interpretation of multiple associatedMedia into the Audubon Core extension.
2017-03-27 Changed names of queries "Update occurrenceID in SimpleDwC" and "Update references in SimpleDwC" to "Update SimpleDwC - occurrenceID" and "Update SimpleDwC - references" just for consistency.
2017-03-13 Added capacity to interpret date information from verbatimEventDates of the form DD-mmm-YY when year is also given.
2017-01-26 Amended 'Update to' expression for query "Update dynamicProperties for Weight" to avoid syntax errors.
2017-01-16 Added capacity to interpret date information from verbatimEventDates of the form DDmmmYYYY.
2016-10-19 Cleaned up query "SimpleDwC-verbatim from Specimens" to map to standard Darwin Core field names.
2016-10-01 Added processing steps for Dates (format yyyy-mm/yyyy-mm) and ScientificNames containing 'cf'. Added organismQuantity and organismQuantityType to the SimpleDwC structures throughout.
2016-09-27 Fixes to major release, which included some mappings based on a non-generic linked occurrence table.
2016-09-23 Added Plants processing to the migrator suite.
2016-09-23 Major overhaul to vocabularies and associated queries. Updates to migrators older than today must use this or newer migrator template. The old ones are incompatible with the newer vocabularies. This change is a move closer to replacing the Access-based migrators with Kurator work flows.
2016-06-02 Modified macro "Do DwC2 Migration" to remove records with duplicate occurrenceIDs from table SimpleDwC rather than from table SimpleDwC-verbatim.
2016-05-25 Added support for a Plants migrator bat file and made missing Fungi migrator bat file.
2016-05-22 Amended DwC2migration-X.bat scripts to name the IndividualCount out of range report correctly based on the script being run.
2016-04-01 Amended minor details in the macro "Do DwC2 Migration". Not functionally different from the 2016-03-31 version.
2016-03-31 Added query "Report - individualCount out of range" and support to check that the original field is numeric and not less than zero. Added to Reports generated as well.
2016-03-14 To support providing a lookup between a deprecated occurrenceID and its replacement occurrenceID in a ResourceRelationship Extension, added tables ResourceRelationshipExtension and ResourceRelationshipExtensionTemplate. Added query "Append occurrenceIDs to ResourceRelationshipExtension". Added processing steps to the macro "Do DwC2 Migration", but turned off by default.
2016-03-14 To support processing multiple media in the associatedMedia field into distinct Audubon Core records. Added tables "Media" and "MediaTemplate". Added query "qryMedia". Added Module "MediaModule". Added processing steps to the macro "Do DwC2 Migration", but turned off by default.
2016-03-14 Changed all reports except "Report - missing catalogNumber", "Report - StatsDifferences", and "Report - StatsCompleteness" to use export specifications and to output .txt files instead of csv files.
2016-03-14 Added an InvertsDwC2.mdb database to the workspace folder.
2016-02-03 Major overhaul to date processing to permit eventDate as year only and to correctly process the dd MMM pattern.
2015-11-25 Further amendments for date processing.
2015-11-24 Added processing for Entomological date styles (e.g., 24.xi.2015). Amended date processing that created badly formatted ISO8601 date ranges with a leading / (e.g., /2015-11-24).
2015-10-23 Amended Legacy Coordinate processing queries to interpret new pattern found in MNHN Mammals (e.g., 09°10,391'_N 008° 56,062'_W and 49°21 Sud 67°10 Est).
2015-10-21 Added queries for additional dynamicProperties based on fields in UMMZ Birds. Rearranged the order of dynamicProperty additions to alphabetical.
2015-10-15 Added support for the Audubon Core Extension.
2015-10-13 Added queries for additional dynamicProperties based on fields in CRCM Birds and Mammals.
2015-10-13 Added query in Legacy Coordinate processing to translate European decimal indicator "," to ".".
2015-10-09 Added script globalnamesresolver.json to be used in OpenRefine to call on the Global Names Resolver API. This is for convenience. It does not affect the migrator.
2015-09-20 Added criterion null to query "Update Taxon 2 - scientificName" so that original scientificNames that ought to be kept (e.g, those containing authorship) are not overwritten.
2015-09-05 Further changed Coordinate queries to accommodate idiosyncrasies found in CM Herp data. Added query "Update Identification - dateIdentified."
2015-09-04 Changed Legacy Coordinate queries to prioritize verbatim coordinates from processing over degrees minutes seconds based on TCWC data set. Otherwise some coordinates in verbatim latitude and longitude were lost because there where zero values in the degrees minutes seconds fields.
2015-09-04 Added queries "Update Occurrence - preparations for blood sample" and "Update Occurrence - preparations for ectoparasites".
2015-08-06 Added processing for duplicate occurrenceIDs.
2015-06-12 Added common path to Access 2015 executable in the .bat scripts.
2015-04-23 Amended query "Update modified to yyyy-mm-dd" so that it only affects values where the criterion IsDate(modified)=True is satisfied.
2015-04-11 Added scripts, databases, and macros for Audio data sets.
2015-04-09 Amended macro "Do DwC2 Migration" to turn on append lookup months inside the LegacyDates processing series.
2015-04-08 Modified the query "LegacyDates processing 02a" to trim white space from the calculated LatestDateCollected.
2015-04-08 Added queries "LegacyDates processing 02c" and "LegacyDates processing 02d" to accommodate verbatimEventDates of the form XX XXX XXXX.
2015-04-06 Replace generic queries "Update Occurrence - preparations for Part X" with queries for specific part names. Doesn't change the functionality of any existing migrators, just makes it easier to create new ones.
2015-04-04 Added queries for various dynamicProperties. Made others that are for lengths consistent in checking for the condition <>"0".
2015-04-01 Added query "LegacyDates processing 02b" to process verbatimDates of the form YYYYMMDD. Updated the macro "LegacyDates Test" to include this query and added two examples to the table "LegacyDatesTest".
2015-04-01 Changed query "LegacyDates processing 02" to let through verbatimDates that are numeric and only 4 digits rather than any numeric.
2015-03-19 Made corrections to the file "README - MigratorPrepSteps.txt".
2015-03-05 Corrected query "Append new Geography to LookupGeography", which wrote the municipality data into the municipality field in the LookupGeography table rather than writing it into the verbatimMunicipality field. Relevant error to any collection with municipality in the original data.
2015-02-25 Added check for "n/a" in queries "Update Taxon - specificEpithet for unknown" and "Update Taxon - infraspecificEpithet for unknown".
2015-02-25 Changed separator in the "withhold" queries from "," to " | ".
2015-02-18 Added and amended steps in Legacy Date processing to account for verbatimEventDates of that have IsDate() true but have no day (e.g., Jun-48) and for decades (e.g., 1980s). Upgrading from the version 2015-01-27 should replace the entire LegacyDates processing section.
2015-01-27 Amended joins in queries "Append to StatsCompleteness" and "diff-..." to use processingID instead of ID. This caused quality statistics not to be captured from 31 Oct 2014 to 27 Jan 2015.
2015-01-05 Amended query "Update dynamicProperties to close" so that it always closes.
2014-12-24 Amended queries "Step [nn]" to better reflect the activity. Renumbered steps 24 and later. Set them to run in the macro "Do DwC2 Migration".
2014-12-24 Added query "Update Location 2 - countryCode Country Continent" to grab potentially standardized geography from vocabularies even if not finished when the current countryCode is null. This will make much more of the geography standard even if it isn't fully reconciled in the LookupGeography table.
2014-12-23 Reduced all reports to one series in the macro "Do DwC2 Migration". Added renaming reports by collection type to the scripts "DwC2migration-[type].bat".
2014-12-23 Updated the query "Update references in SimpleDwC" to match the slugify function in the VertNet indexer under most circumstances. Does not account for accents as slugify does. Should not be a problem for migrators.
2014-12-19 Increased the number of split files to 10 instead of 6 to accommodate files up to 1M records when running script "1c - RunAggregators.bat".
2014-12-19 Corrected query "Update from LegacyElevations" so that it populates min and max elevationInMeters rather than min and max depthInMeters.
2014-12-19 Added template queries "Update Occurrence - preparations for Part X" that facilitate the construction of the preparations field if the original has preparations in distinct fields.
2014-12-18 Added query "Update references in SimpleDwC" to construct a reference to the record as it will appear in the VertNet portal if the record does not already have a references value.
2014-12-16 Corrected queries "Update Occurrence - non-printing occurrenceRemarks", "Update Organism - non-printing organismRemarks", "Update Event - non-printing EventRemarks", "Update Location - non-printing LocationRemarks".
2014-12-15 Corrected query "Report - day out of range" so that it does not report day out of range for dates on 31 August.
2014-12-15 Corrected the queries "Legacy coordinatePrecision numeric", "LegacyCoordinateUncertainties numeric", "processingID institutionCode collectionCode coordinatePrecision error", "Report - coordinateUncertainties not numeric".
2014-12-15 Added query "Update Location - non-printing LocationRemarks".
2014-12-15 Added query "Update Occurrence - occurrenceStatus" and removed expression to build occurrenceStatus from queries "SimpleDwC-verbatim from Specimens" and "SimpleDwC-verbatim from Specimens - common".
2014-12-15 Added query "Update Occurrence - establishmentMeans" and removed expression to build establishmentMeans from queries "SimpleDwC-verbatim from Specimens" and "SimpleDwC-verbatim from Specimens - common".
2014-12-15 Added query "Update occurrenceID in SimpleDwC" to set occurrenceIDs if they do not already exist in the source.
2014-12-13 Corrected queries "LegacyDates processing 10", "LegacyDates processing 11" and "LegacyDates processing 15", which set beginday, endday, and day to null for dates on 31 August.
2014-11-28 SimpleDwC-verbatim is now supposed to be filled with only those fields that can be mapped directly from the original. All other fields go through the Legacy table processing using queries such as "LegacyCoordinates from Specimens", "LegacyCoordinateUncertainties from Specimens", etc. Queries that are likely to require customization are marked in the macro comment with an asterisk (*) - for example, "*SimpleDwC-verbatim from Specimens"
2014-11-18 Reordered processing, added reports, added diffs for new DwC terms.
2014-11-18 Added query "Source" to filter records from "Occurrence" when populating table "Specimens". This query is what gets used by the query "Append Source Data to Specimens".
2014-11-18 Changed linked table "Source" to "Occurrence.
2014-11-17 Changed "report - missing catalogNumber" to be based on catalogNumber from Specimens table.
2014-11-17 Changed structure and processing of LegacyData table to several distinct legacy tables for distinct aspects of the data (coordinates, coordinate uncertainties, dates, depths, elevations, scientific names.
2014-11-17 Corrected query "Update modified from previous migration" to change records where modified is null in origin to be the value of modified in the previous migration.
2014-11-17 Changed query "Append new Geography to LookupGeography" to add higherGeography string based on verbatim values of Darwin Core fields in preferred order and separator (continent | country | stateProvince | county | municipality | waterBody | islandGroup | island).
2014-11-17 Changed query "Update higherClassification with verbatim taxonomy" to use the preferred separator " | ".
2014-11-17 Changed query "Prepend verbatim geography to verbatimLocality" to map from Specimens table using preferred order and separator (continent | country | stateProvince | county | municipality | waterBody | islandGroup | island). The query "Update higherGeography" still concatenates in the order and separator (continent;country;stateProvince;county;island;islandGroup;waterBody;municipality) used in the key field in the linked table LookupGeography. Don't change this.
2014-11-17 Changed SpecimensTemplate and Specimens to have a primary key called processingID meant to be used only within the migrator to link records to the original Source imported to Specimens.
2014-11-17 Changed Source model to link to an occurrence.txt file of a Darwin Core archive output from Simple Darwin Core version 2013-10-22 (previous version).
2014-11-17 Changed structure of SimpleDwC, SimpleDwC-verbatim, and SimpleDwCTemplate to include new fields and change rights for license.
2014-11-17 Major overhaul to accommodate Darwin Core version 2014-11-08. Re-ordered all processing. See above for other changes. Upgrades from old versions should be redone from scratch as this is migrator version is too different to just amend an old one.
2014-11-06 Added the missing queries 'diff-decimalLatitudeCount" and "diff-decimalLatitudeCount" and enabled them in the macro "Do DwC2 Migration".
2014-10-10 Moved query "Update georeferences for null decLatLong" to follow queries "Update geodeticDatum to not recorded" and "Update georeferenceVerificationStatus to unverified" in macro "Do DwC2 Migration".
2014-10-03 Added query "Update coordinateUncertaintyInMeters for precision".
2014-10-03 Added Report exports for Eggs and/or Nests to the macro "Do DwC2 Migration".
2014-09-23 Added query "Update DecLatLong for precision".
2014-09-20 Added queries "0 verbatimDepth with Unit", "1 Parsed minmaxDepth", "2 Parsed minmaxDepth", "Update LegacyData with parsed minmaxDepth" and improved the similar query for verbatimElevation processing - "0 verbatimElevation with Unit". Added verbatimDepth field to LegacyData table to support this.
2014-09-19 Added queries "Update out of range year to null", "Update out of range month to null", "Update out of range day to null".
2014-09-19 Added check for leap year not in centuries except those evenly divisible by 400 in query "Update DayOfYear".
2014-09-16 Added processing to replace more symbols in queries "01 - Legacy latitude replace characters" and "01 - Legacy longitude replace characters".
2014-09-16 Corrected processing of signs in queries "02 - Legacy longitude decimal degrees sans hemisphere" and "13 - LegacyData decimal latlong".
2014-09-16 Corrected error that referenced verbatimLatitude instead of verbatimLongitude in query "01 - Legacy longitude replace characters".
2014-09-16 Copied the structure of LegacyData from TTU Mammals migrator so that the step-wise processing implemented on 2014-06-11 can function correctly with the template and migrators derived from it.
2014-09-11 Added query "Update eventDate for yyyy-mm-dd" to correct eventDates with single digit month and day. This happens systematically with collections exported from Specify.
2014-09-01 Updated queries "Update infraspecificEpithet for SSP" and "Update specificEpithet for SP".
2014-07-24 Added query "Report - missing catalogNumber".
2014-07-16 Added query "Update higherGeography for all Nulls".
2014-06-11 Added step-wise processing for verbatim coordinates based on diversity in TTU Mammals data. Should be more readily reusable in other contexts than what the migrator had before. Removed previous LegacyData verbatim coordinate processing (
"Update LegacyData with LatLong from verbatimCoordinates",
"Update LegacyData for missing Dirs",
"Update LegacyData with DecimalDegrees",
"Update LegacyData with DMS",
"Update LegacyData with MaxErrorDistance",
"Update LegacyData with parsed decLatLong").
2014-06-11 Changed query "Update CoordinateUncertaintyInMeters from LegacyData" to convert coordinateUncertaintyInMeters to a long integer.
2014-05-21 Added reports for Year, Month, and Day out of range.
2014-05-21 Changed typo in field verbatimCoordinnates to verbatimCoordinates in LegacyData tables.
2014-04-14 In the query "SimpleDwC-verbatim from Specimens", added a check for preparator=collector, and used only one copy in recordedBy if so.
2014-02-03 Added query to turn coordinateUncertaintyInMeters to null if "0".
2013-12-21 Made amendments to various queries so that they are more generic and less likely to require adjustment when used as a template for new collections. Includes "Append Source Data to Specimens", "Prepend verbatim geography to verbatimLocality", "SimpleDwC-verbatim from Specimens", and "Update higherClassification with verbatim taxonomy".
2013-12-20 Added queries "Update species subspecies to null for null genus" and "Update subspecies to null for null species" so that records without higher ranks do not have lower rank epithets.
2013-12-10 Amended and separated checks for non-numeric decimalLatitude and decimalLongitude in queries "Update DecLatLong for non-numeric1" AND "Update DecLatLong for non-numeric2".
2013-12-10 Added reports for fossils to the "Do DwC2 Migration" macro.
2013-09-24 Added reports for StatsDifferences and StatsCompleteness and included these in the generated reports.
2013-09-02 Amended query name for "Update NomenclaturalStatus" to "Update NomenclaturalCode", which is what it actually does.
2013-08-27 Moved the encumbering part of the "Do DwC2 Migration" macro to just before copying SimpleDwC-verbatim into SimpleDwC. Otherwise the encumbrances would not occur.
2013-07-28 Separated the DiGIR processing into its own template and script to avoid running it if not necessary.
2013-07-28 Changed PurgeNonprintingCharacters.sh to replace last carriage return/line feed combination in the file with ññ so that split files will concatenate without joining to the first line of the next file.
2013-07-17 Added GitHub repository at https://github.com/VertNet/toolkit to manage code.
2013-07-12 Changed PurgeNonprintingCharacters.sh to use ññ instead of || as search and replace pattern because content with || was found in provider data (Macaulay Library).
2013-07-12 Added notHigherGeography field to LookupGeography in VertNetVocabularies.mdb, to capture information in higher geography that should be moved to locality. Added report to show records where changes can be suggested.
2013-07-12 Added error field to the indeterminate geography report and began managing that field in VertNetVocabularies.mdb.
2013-07-03 Amended Aggregator database for Append Queries (Fish, Fossil, Herp, and Mammal) that mapped verbatimCoordinates to verbatimCoordinateSystem.
2013-07-03 Added GeorefVerificationStatus vocabulary management.
2013-06-10 Added Purge processing for Dublin Core types Sound, StillImage, and MovingImage to PurgeNonprintingCharacters.sh and NewLineIssues.awk.
2013-03-04 Corrected faulty logic in "Update DecLatLong for range" and "Update DecLatLong for non-numeric" queries.
2013-05-03 Change strategy for verbatim classification. Store this in higherClassification with all ranks given in the source from genus up. Do not write over it with standardized values.
2013-05-02 Added a duplicate catalogNumber report to the outputs.
2013-05-01 Slightly rearranged and demarcated processing macro to distinguish production of SimpleDwC-verbatim, LegacyData, and SimpleDwC. Better demonstrates where data cleaning could be done outside the migrator.
2013-05-01 Added some verbatimLatitude and Longitude processing queries used in CM migrator to produce decimalLatitude and Longitude.
2013-04-18 Non-printing character report (in RunAggregators script) added to outputs in reports folder.
2013-04-17 Added more paths in scripts to find MSACCESS.EXE so that scripts will run on a larger number of default configurations.
2013-04-17 Added the capability to split the file ForIPT\SimpleDwCForVertNetPrePurge.csv into smaller files for purge processing to avoid out of memory errors.
2013-04-15 Corrected the Prepend Geography to verbatimLocality query to be an update query.
2013-04-15 Added LookupReproductiveCondition table to local vocabularies processing and MySQL, including statistics. Re-linked to StatsCompleteness table.
2013-03-15 Created reports folder and changed report file export destinations to it.
2013-03-15 Amended reports to compare verbatim original genus, family, order to standard versions - was comparing published versus standard.
2013-03-15 Start adding non-DwC standard taxonomy rank information to dynamicProperties.