<?php
echo "Development of a text mining tool which will search in all the open access articles of Pubmed and return snippets for every keyword.\r\nThe keywords are listed below\r\nThe software must check on the free articles of Pubmed (which can be found in this URL : http://www.ncbi.nlm.nih.gov/pmc/tools/ftp/ in XML format) and extract for every article that contain the word CKD (Chronic Kidney Disease) and at least one of the keywords, the snippets that are related to every keyword.\r\nEvery list of keywords and snippets must be linked to one PMID (Pubmed ID)\r\nThe snippets must be included in a database in which the excell cells will be linked to.\r\n\r\nPMID\r\nLiteratureTargetSequence\r\nSOURCEdb\r\npub-ID\r\nAssociated disease\r\nBiomarker\r\nGeneSymbol\r\nProtein/Gene/Metabolite Name\r\nspecies\r\ndisease\r\ntype of controls\r\ndisease induction\r\ntissue\r\nsubcell\r\nN (pool; number of pooled individuals from same disease state)\r\n#gel slices\r\nquant method\r\n# of Identified Peptides\r\n# of Unique Peptides\r\nIdentMeth\r\nIdentScore\r\nSequence Coverage (%)\r\nemPAI\r\nTSC\r\nm/z\r\nNSAF\r\nPHR\r\nQUESTIONABLE/CONTAMINANT\r\nID quality\r\nZ score\r\nConfidence %\r\npvtest\r\npvalue\r\nSpot N°\r\nSubcellular Location\r\nProtMod\r\nRegulation in disease\r\nRatio (disease/control) +/- SEM\r\nfrequency found in disease %\r\nfrequency found in healthy %\r\nSensitivity\r\nSpecificity\r\nValidation method\r\nValidation case disease\r\nValidation control disease\r\nValidation case N\r\nValidation control N\r\nValidation diagnostic accuracy\r\nValidation prognostic association\r\n";
?>
Development of a text mining tool which will search in all the open access articles of Pubmed and return snippets for every keyword.
The keywords are listed below
The software must check on the free articles of Pubmed (which can be found in this URL : http://www.ncbi.nlm.nih.gov/pmc/tools/ftp/ in XML format) and extract for every article that contain the word CKD (Chronic Kidney Disease) and at least one of the keywords, the snippets that are related to every keyword.
Every list of keywords and snippets must be linked to one PMID (Pubmed ID)
The snippets must be included in a database in which the excell cells will be linked to.
PMID
LiteratureTargetSequence
SOURCEdb
pub-ID
Associated disease
Biomarker
GeneSymbol
Protein/Gene/Metabolite Name
species
disease
type of controls
disease induction
tissue
subcell
N (pool; number of pooled individuals from same disease state)
#gel slices
quant method
# of Identified Peptides
# of Unique Peptides
IdentMeth
IdentScore
Sequence Coverage (%)
emPAI
TSC
m/z
NSAF
PHR
QUESTIONABLE/CONTAMINANT
ID quality
Z score
Confidence %
pvtest
pvalue
Spot N°
Subcellular Location
ProtMod
Regulation in disease
Ratio (disease/control) +/- SEM
frequency found in disease %
frequency found in healthy %
Sensitivity
Specificity
Validation method
Validation case disease
Validation control disease
Validation case N
Validation control N
Validation diagnostic accuracy
Validation prognostic association