TY - JOUR
T1 - KUPS
T2 - Constructing datasets of interacting and non-interacting protein pairs with associated attributions
AU - Chen, Xue Wen
AU - Jeong, Jong Cheo
AU - Dermyer, Patrick
PY - 2011/1
Y1 - 2011/1
N2 - KUPS (The University of Kansas Proteomics Service) provides high-quality protein-protein interaction (PPI) data for researchers developing and evaluating computational models for predicting PPIs by allowing users to construct ready-to-use data sets of interacting protein pairs (IPPs), non-interacting protein pairs (NIPs) and associated features. Multiple filters and options allow the user to control the make-up of the IPPs and NIPs as well as the quality of the resultant data sets. Each data set is built from the overall database, which includes 185 446 IPPs and ̃1.5 billion NIPs from five primary databases: IntAct, HPRD, MINT, UniProt and the Gene Ontology. The IPP set can be set to specific model organisms, interaction types and experimental evidence. The NIP set can be generated using four different strategies, which can alleviate biased estimation problems. Lastly, multiple features can be provided for all of the IPP and NIP pairs. Additionally, KUPS provides two benchmark data sets to help researchers compare their algorithms to existing approaches. KUPS is freely available at http://www.ittc.ku.edu/chenlab.
AB - KUPS (The University of Kansas Proteomics Service) provides high-quality protein-protein interaction (PPI) data for researchers developing and evaluating computational models for predicting PPIs by allowing users to construct ready-to-use data sets of interacting protein pairs (IPPs), non-interacting protein pairs (NIPs) and associated features. Multiple filters and options allow the user to control the make-up of the IPPs and NIPs as well as the quality of the resultant data sets. Each data set is built from the overall database, which includes 185 446 IPPs and ̃1.5 billion NIPs from five primary databases: IntAct, HPRD, MINT, UniProt and the Gene Ontology. The IPP set can be set to specific model organisms, interaction types and experimental evidence. The NIP set can be generated using four different strategies, which can alleviate biased estimation problems. Lastly, multiple features can be provided for all of the IPP and NIP pairs. Additionally, KUPS provides two benchmark data sets to help researchers compare their algorithms to existing approaches. KUPS is freely available at http://www.ittc.ku.edu/chenlab.
UR - http://www.scopus.com/inward/record.url?scp=78651329116&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78651329116&partnerID=8YFLogxK
U2 - 10.1093/nar/gkq943
DO - 10.1093/nar/gkq943
M3 - Article
C2 - 20952400
AN - SCOPUS:78651329116
SN - 0305-1048
VL - 39
SP - D750-D754
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - SUPPL. 1
ER -