Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.
Derivations Procedures - orgD
Retrieved and verify dPairs from original Lexical Tools DM.DB and add them to derivation table. This should be done after nomD, prefixD, suffixD, and zeroD. All orgD with EUIs (in Lexicon EUIs) and valid dPairs and not included in our system are added to our final derivation table. We don't expect too many valid dPair from orgD because only new LexRecords in orgD will be added. However, this procedure requires many manually updates (in Steps: 2,5,6,7,8,81)
I. Directory:
II. Input Files (./data/${YEAR}/dataOrg/):
shell> ${ORG_D}/bin/GetOrgD ${YEAR}
0
III. Final files for allD (release)
IV. Summary of GetOrgD
Step | Description and Program | Input | Output | Notes | |||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 |
| See section II. | See section II. |
| 0 | ||||||||||||||||||||||||||||||||||||||||||
1 |
|
|
|
| 1 | ||||||||||||||||||||||||||||||||||||||||||
2 |
|
|
|
| 2 | ||||||||||||||||||||||||||||||||||||||||||
3 |
|
|
|
| 3 | ||||||||||||||||||||||||||||||||||||||||||
4 |
|
|
|
| 4 | ||||||||||||||||||||||||||||||||||||||||||
5 |
|
|
|
|
IV. Processes details:
shell>cd ${DERIVATION}/0.orgD/bin
shell>GetOrgD ${YEAR}
1) Combine Original Lvg Fact dPairs from 5 files
Combine above five files:
=> Generate ./data/${YEAR}/data/orgD.raw.data
2) Reformat: remove comments, uSort, empty line
Reformat: remove comments, uSort, empty lines:
=> Generate ./data/${YEAR}/data/orgD.yes.data
=> Remove 1st (empty) line in ./data/${YEAR}/data/orgD.yes.data
3) Add EUI to orgD.yes.data.final (use E0000000 for no EUI)
Add EUI to dPairs in orgD.yes.data, use E0000000 if no EUI found
=> Prepare:
=> Generate:
4) Add dType of orgD.yes.data.final.yesEui
Add dType to orgD.yes.data.final.yesEui to zeroD, suffixD, and prefixD:
generate:
5) Add tag to prefixD: orgD.yes.data.final.yesEui.type.P
Generates:
6) Add tag to suffixD: orgD.yes.data.final.yesEui.type.S
Generates:
7) Add tag to zeroD: orgD.yes.data.final.yesEui.type.Z
Generates:
Ideally, all orgD should be automatically generated by our new derivations generation processes by adding: more prefix (for prefixD) and SD candidate rules (for suffixD). No new zeroD should be found because our system should cover all possible zeroD (please notes that acronyms or abbreviations can't be zeroD). In 2014 release, we manually verify and add orgD into derivational table. Please see the reports on orgD, 2014 for detail.
Please refer to derivation design documents in Lexical Tools for details.