Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

Text Categorization

Pre-Process: Ui-Jid-Jds

  • Description:
    This file includes Ui-Jid-JD information in the train set (MEDLINE citations).

  • Input:
    • MEDLINE 2004: /nfsvol/indaux/MEDLINE_baseline/2004/medline04n${NUM}.txt
    • Date created (DA) from year: 1999, 2000, 2001
    • ${NUM} are file names of file include citations with DA in years of 1999, 2000, 2001

    • Jds.txt
    • Jid-Ta-Jds

  • Java Files & Algorithm:
    • GenerateFilesFromMedLine.java
    • Use JID to get the UI-Jid-Jds information
      • Read in all fields ( PMID, TI, AB, TA, JID, RN, MH) from MedLine citations if DA is within specified range
      • Read in JDs information through JID and Jid-Ta-Jds
      • Read in JDs information from JDs
      • Sent PMID, JID, JDs to uiJidJds${NUM}.txt

  • Output: