OpenSeaMap-dev:IHO Hydographic Dictionary S-32: Unterschied zwischen den Versionen
Markus (Diskussion | Beiträge) (→clear structure) |
Markus (Diskussion | Beiträge) |
||
Zeile 1: | Zeile 1: | ||
Data base description... | Data base description... | ||
+ | |||
+ | = HowTo = | ||
+ | |||
+ | For editing the database: | ||
+ | # Start Excel (IHO_Hydrographic_Dictionary_S-32_Terms.xls) | ||
+ | # In table "Start": click "Edit database" | ||
+ | |||
+ | You see: | ||
+ | : an '''Excel sheet''' with all terms in English, Français and Español. | ||
+ | : an '''Userform''' with the "English master" (left), and editable (right) | ||
+ | |||
+ | In the Excel sheet you can: | ||
+ | : Compare al terms in English, Français and Español | ||
+ | : Edit a dataset (by doubleclick into a term) | ||
+ | |||
+ | In the User-form you can: | ||
+ | : Set the language displayed in the right side for editing | ||
+ | : Search in "Term" | ||
+ | : Scroll between datasets (by arrow-button) | ||
+ | : Edit each data field | ||
+ | : Check each data field and confirm this "Full check" | ||
+ | |||
+ | = ToDo = | ||
+ | {| class="wikitable sortable" | ||
+ | ! OK || Task || Who || Until || Done || Remarks | ||
+ | |- | ||
+ | | {{ok}} || Investigate the source documents || OpenSeaMap || || 2015-12- || | ||
+ | |- | ||
+ | | {{ok}} || Transform the source into a database || OpenSeaMap || 2016-01-10 || 2015-01-20 || | ||
+ | |- | ||
+ | | {{ok}} || Develop a tool for editing datasets || OpenSeaMap || || || | ||
+ | |- | ||
+ | | || Decisions about rules || IHB || || || | ||
+ | |- | ||
+ | | || Implement new rules || OpenSeamap || || || | ||
+ | |- | ||
+ | | || Edit the "English master" || IHB || || || | ||
+ | |- | ||
+ | | .. || || || || || | ||
+ | |} | ||
+ | |||
+ | = Improvements already done = | ||
+ | |||
+ | # Unique ID for each Term (in addition to the old number). | ||
+ | # Split Term and Description. | ||
+ | # Extract semantic data from text (Type of word, Sex of word, Plural, en-UK/en-US) <br>(~90% of all, some errors included) | ||
+ | # Add meta data about the datasets (Safe datum, LastCheck datum, Deleted) | ||
+ | |||
= Rules = | = Rules = |
Version vom 29. Januar 2016, 10:14 Uhr
Data base description...
Inhaltsverzeichnis
HowTo
For editing the database:
- Start Excel (IHO_Hydrographic_Dictionary_S-32_Terms.xls)
- In table "Start": click "Edit database"
You see:
- an Excel sheet with all terms in English, Français and Español.
- an Userform with the "English master" (left), and editable (right)
In the Excel sheet you can:
- Compare al terms in English, Français and Español
- Edit a dataset (by doubleclick into a term)
In the User-form you can:
- Set the language displayed in the right side for editing
- Search in "Term"
- Scroll between datasets (by arrow-button)
- Edit each data field
- Check each data field and confirm this "Full check"
ToDo
Improvements already done
- Unique ID for each Term (in addition to the old number).
- Split Term and Description.
- Extract semantic data from text (Type of word, Sex of word, Plural, en-UK/en-US)
(~90% of all, some errors included) - Add meta data about the datasets (Safe datum, LastCheck datum, Deleted)
Rules
clear structure
Use only:
- Term
- Description
For further attributes:
- use a specific column for each attribute.
All texts in UTF-8.
Attribute | Type | Length | Content | Description |
---|---|---|---|---|
ID | incremential | Unique ID for all languages | ||
number | text | 6 | #### a | Old number for old terms (may be obsolete in future) |
term | text | 255 | Term with one or more words, without special marks, without abbrevieations, optional one (1) colon (:) | |
description | memo | 65.535 | Description of the term | |
sex | text | 1 | f m n |
feminine masculine neutrum |
pl | text | 2 | pl <empty> |
term is commonly used in plural term in singular |
type | text | 5 | subst adj adv vi vt |
substantiv adjective adverb verb intransitive verb transitive |
deleted | boolean | yes/no | mark as deleted if not more used | |
en-AB | text | en-BE en-AE en-both |
if a term is only British or only American or both | |
.. |
use only words in a term
old:
- 86 | alignment correction(tape)
new:
- 86 | alignment correction by tape
use speakable terms
- and link to the ordered term
86 alignment correction by tape see "tape: alignment correction" 5292 tape: alignment correction (Description)
split Sub-Languages
(but be sure it is not only a dialect)
- en into en-BE and en-AE
- es into es-ESP and es-ARG
old:
1756 fair chart (Brit) (Description) 1757 fair sheet see "fair chart" 4793 smooth sheet (US) see "fair chart"
new en-BE:
1756 fair chart (Description) 1757 fair sheet see "fair chart" 4793 (nothing) see "fair chart"
new en-AE:
1756 smooth sheet (Description) 1757 (nothing) see "smooth sheet" 4793 (nothing) see "smooth sheet"
If you don't like to split Sub-Languages:
decide which is the Master Language
(which may be a political issue)
old:
1756 fair chart (Brit) (Description) 1757 fair sheet see "fair chart" 4793 smooth sheet (US) see "fair chart"
new:
1756 fair chart (Description) 1757 fair sheet see "fair chart" 4793 smooth sheet (AE) see "fair chart"
split Multi-Terms
old:
998 continental (or island) shelf
new:
998 a continental shelf 998 b island shelf
old:
398 base tape (or wire)
new:
398 a base tape 398 b base wire
old:
3104 marine nature reserve (U.S. marine sanctuary)
new en-BE:
3104 marine nature reserve
new en-AE:
3104 marine sanctuary
split Multi-Descriptions
old:
1015 control point (Description 1)... (Description 2)...
new:
1015 a control point (Description 1) 1015 b control point (Description 2)
Aberrations needs an own term
old:
1276 deep scattering layer (DSL)
new:
1276 a deep scattering layer 1276 b DSL see "deep scattering layer"
but how to deal with multiple meanings:
1276 c DSL see "Digital Subscriber Line"
use Singular
old:
1424 | divider(s) |
new:
1424 a | divider |
or split if necessary:
1424 a | divider |
1424 b | dividers |
Errors
English
Some small typos...
French
Some terms have double numbers:
1408-1411 | Aberration radiale |
Some terms have no number:
... | Abaque d’échelle |
Spanish
- Terms marked with "(Esp)" this is split between Term and Declaration.
- Terms marked with "(Arg)" this is split between Term and Declaration.