Information about http://www.oceansites.org/docs/OTS2008/lankhorst_OSmeeting_dataformat.pdf

Data Management Issues: Summary of Recent Discussion on Data…

Tags: carval, e mail, file names, format description, instru ments, lankhorst, management issues, mismatch, mooring data, mooring line, mooring name, platform code, quality flag, situ data, subsurface mooring, temperature sensor, time level, xxx site, yyy, zzz,
Pages: 22
Language: english
Created: Fri Apr 11 00:25:29 2008
Display cached document
Page 1
image
Page 2
image
Page 3
image
Page 4
image
Page 5
image
Page 6
image
Page 7
image
Page 8
image
Page 9
image
Page 10
image
Page 11
image
Page 12
image
Page 13
image
Page 14
image
Page 15
image
Page 16
image
Page 17
image
Page 18
image
Page 19
image
Page 20
image
Page 21
image
Page 22
image
         Data Management Issues:
Summary of Recent Discussion on Data Format

                     Matthias Lankhorst

                        10. April 2008


         (Comments arising during presentation added in red)
Existing OceanSITES User Manual v1.0

Past months & here:
Revised Manual with Complete Format Description (E-mail from Thierry
Carval this March)--READ THIS TONIGHT!

Format is designed to accommodate:
- Subsurface mooring data (e. g. ADCP, T, S, Fluoro, Nuts)
- Surface buoy data (e. g. meteorology)
- Shipborne CTD, glider
- (Hopefully) data on sections and interpolated onto XYZT grids
NetCDF File Format:
File contains "attributes" and "variables":
nc = netcdf('OS_MOVE-V404-1_200001_TS.nc', 'noclobber');

nc.title = ncchar(''OceanSITES in-situ data, MOVE mooring V404'');
nc.conventions = ncchar(''OceanSITES User Manual Version 1.0'');
nc.netcdf_version = ncchar(''Unknown'');
nc.creation_date = ncchar(''2008-01-30 05:00:00 UT'');
...total of circa 30-40 attributes

nc{'TEMP'} = ncdouble('TIME', 'LEVEL'); %% 69173 elements.
nc{'TEMP'}.FillValue_ = ...
nc{'TEMP'}.long_name = ...
nc{'TEMP'}.standard_name = ...
...total of circa 10 attributes

nc{'TEMP_QC'} = ncbyte('TIME', 'LEVEL'); %% 69173 elements.
nc{'TEMP_QC'}.FillValue_ = ncbyte(0);
nc{'TEMP_QC'}.long_name = ncchar(''Quality Flag for TEMP'');
nc{'TEMP_QC'}.conventions = ncchar(''OceanSITES reference table 2'');
Reminder on File Names:

OS XXX YYY ZZZ.nc
or: OS XXX YYY ZZZ PARTx.nc

XXX: Site name actually called "platform code"
YYY: Configuration
ZZZ: Variables
PARTx: optional if files larger than 100 MB

Only three (or four) " "

Bad: OS V404 1 200001 CTD.nc

Good: OS MOVE-V404-1 200001 TS.nc
actually not quite good because mooring name not listed in official catalogue
What should be done if depth (DEPH) parameters of different instru-
ments in the same mooring mismatch?

Example: an ADCP at the surface delivers velocity at fixed, well-
known depths, while a temperature sensor on the mooring line moves
up and down as the mooring is tilted in the current.

Solution: Put data in separate files.

Done in recent manual update.
This was discussed controversely.
Define "parameters awaiting definition" as of manual v1.0 section 4.3!
Check consistency with other naming conventions (Nan Galbraith).

More parameters to define:


 · Sea water velocity in XY-coordinates (eastward and northward,
   resp.)


 · Air temperature (or is it ATMP ? remove ambiguity with CAPH!)


Has been worked on (Thierry Carval, Nan Galbraith). Are we done
yet? Cf. discussion led by Thierry later.
Manual section 4.3 references www.oceansites.org/data/units, but
this page does not exist. Suggest to create this and have it include
the standard parameter names and units.

Thierry: Status?
Global / General Attributes

Suggestion: Decide to NOT have separate metadata files. Include
all info in global attributes of data file. Remove term "general at-
tributes".

Consequence: attributes must be machine-readable and thus strictly
standardized.

Advantage: Fewer files, saves time for data providers. Information is
already there anyways.

Not agreed upon. There will be more metadata, cf. discussion led by Nan Galbraith later.
More attributes...

I find it confusing that some are CAPITALIZED and others not - do
we want to change this?

Suggestion: all attributes non-caps, all variables and dimensions CAPS.
Conflicts between global/general attributes:

creation date vs. DATE CREATION
distrubution statement vs. DATA RESTRICTIONS
keywords vocabulary exists, but not keywords
Shouldn't we have REFERENCE DATE TIME?
'institution' listed twice



Have been resolved in recent manual update. M. Lankhorst and T.
Carval: double-check.
Explicitly state that all global/general attributes are of type "char",
even those that contain numbers?
We want one human-readable time format in the attributes (previously
two).

Done in recent manual update. M. Lankhorst and T. Carval: double-
check.
Do we want * QC for TIME, LATITUDE, LONGITUDE, DEPH? I
think yes.

Has been included in recent manual update.
Update to manual: label and number "reference tables" more clearly
(e.g. reference tables 1 and 3 each deserve a seperate caption).

Done. M. Lankhorst will double-check.
Some of the data files are rather big (>100 MB). Should we have a
strict limit or at least a recommendation for file size (e.g.