Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

Text Categorization

Pre-Process: Jdid-Dc-NFactor

  • Description:
    This file includes information of Jdid-Dc-NFactor of all JDs in training set (MEDLINE).

  • Input:

  • Java Files & Algorithm:
    • GenerateJdDcNFactor.java
    • Read in Journal Descriptors from jds.txt
    • Calculate dc for JDs
      • Read UI, JID, JDs from UiJidJds.${NUM}.txt
      • Update document count for all JDs
    • Calculate Nfactor
      • Calculate total and average document count for all JDs
      • avg. = total Dc for all JDs / num of JDs
      • Assign NFactor for all JDs
        • NFactor = avg./jdDc (if jdDc > avg.)
        • NFactor = 1.0 (otherwise)
    • Print out Jdid-Dc-NFactor

  • Output file: