Selected Recent Publications

2014

  • Zhao, J., Huang, X. J. and Ye, Z. "Modeling Term Associations for Probabilistic Information Retrieval'', ACM Transactions on Information Systems (TOIS). ACM Publisher. Vol 32, No. 2, Article 7, 47 pages. April 2014. ISSN: 1046-8188 and EISSN:1558-2868 (PDF)
  • Ye, Z. and Huang, X. J. "A Learning to Rank Approach for Quality-Aware Pseudo Relevance Feedback" (32 pages), accepted by Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher, September 2014. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890.
  • Feng, W., Zhang, Q, Hu, G. and Huang, X. J. "Mining Network Data for Intrusion Detection through Combining SVM with Ant Colony", Future Generation Computer Systems Journal (FGCS). ELSEVIER Publisher.Vol 37: 127-140. July 2014. ISSN: 0167-739X.
  • Ye, Z. and Huang, X. J. "A Simple Term Frequency Transformation Model for Effective Pseudo Relevance Feedback'', Proceedings of the 37th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'14), Gold Coast, Australia. July 6-11, 2014 (PDF)
  • Zhao, J. and Huang, X. J. "An Enhanced Context-sensitive Proximity Model for Probabilistic Information Retrieval'', Proceedings of the 37th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'14), Gold Coast, Australia. July 6-11, 2014 (PDF)

2013

  • Yin, X, Huang, X. J., Li, Z. and Zhou, X. "A Survival Modeling Approach to Biomedical Search Result Diversification Using Wikipedia", IEEE Transactions on Knowledge and Data Engineering (TKDE). IEEE Publisher. 25(6): 1201-1212, 2013. ISSN: 1041-4347.
  • Huang, X. J. Miao, J. and He, B. "High Performance Query Expansion Using Adaptive Co-training", Information Processing & Management: An International Journal (IPM). ELSEVIER Publisher. 49:441-453, 2013.
  • Zhao, J., Huang, X. J. and Hu, T. "BPLT+: A Bayesian-based Personalized Recommendation Model for Health Care'' (18 pages), to appear in BMC Genomics. August 2013. ISSN: 1471-2164.
  • Daoud, M. and Huang, X. J. "Modelling Geographic, Temporal and Proximity Contexts for Improving Geo-Temporal Search", Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher, 64(1):190-212, 2013. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890.
  • Hu, Q. and Huang, X. J. "Enhancing Genomics Information Retrieval Through Dimensional Analysis", Journal of Bioinformatics and Computational Biology (JBCB). 11(3):14 pages, 2013. ISSN (Print): 0219-7200 and ISSN (Online): 1757-6334.
  • Liu, Y., Yu, X., An, A. and Huang, X. J. "Riding the Tide of Sentiment Change: Sentiment Analysis with Evolving Online Reviews", World Wide Web Journal (WWWJ). Springer-Verlag Publisher. 16(4): 477-496, 2013. ISSN (Print): 1386-145X and ISSN (Online): 1573-1413.
  • Ayadi, H., Torjmen, M., Daoud, M., Benjemaa, M. and Huang, X. J. "Correlating Medical-dependent Query Features with Image Retrieval Models Using Association Rules" (10 pages), Proceedings of the 22nd International ACM Conference on Information and Knowledge Management (CIKM'13), San Francisco, CA, USA. October 27 - November 1, 2013 (PDF)
  • An, X. and Huang, X. J. "Boosting Novelty for Biomedical Information Retrieval Through Probabilistic Latent Semantic Analysis'', Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'13), Dublin, Ireland. 28 July - 1 August, 2013 (PDF)
  • Babashzadeh, A., Huang, X. J. and Daoud, M. "Exploiting Semantics for Improving Clinical Information Retrieval'', Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'13), Dublin, Ireland. 28 July - 1 August, 2013 (PDF)
  • Mahdabi, P., Gerani, S., Huang, X. J. and Crestani, F. "Leveraging Conceptual Lexicon: Query Disambiguation Using Proximity Information for Patent Retrieval'', Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'13), Dublin, Ireland. 28 July - 1 August, 2013 (PDF)

2012

  • Yu, X, Liu, Y., Huang, X. J. and An, A. "Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain", IEEE Transactions on Knowledge and Data Engineering (TKDE). IEEE Publisher. 24(4): 720-734, 2012. ISSN: 1041-4347.
  • Daoud, M. and Huang, X. J. "Mining Query-Driven Contexts for Geographic and Temporal Search" (21 pages), accepted by International Journal of Geographical Information Science (IJGIS). Tayor & Francis Publisher, November 2012. ISSN (Printed): 1365-8816 and ISSN (Online): 1362-3087.
  • Ye, Z., Huang, X. J., He, B. and Lin, H. "Mining Multilingual Association Dictionary from Wikipedia for Cross-Language Information Retrieval" (28 pages), accepted by Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher, March 2012. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890.
  • Hu, Q., Huang, X. J. and Hu, T. "Modeling and Mining Term Association for Improving Biomedical Information Retrieval Performance'' (18 pages), to appear in BMC Bioinformatics. June 2012. ISSN: 1471-2105.
  • Yin, X., Huang, X. and Li, Z. "Re-Ranking with Context for High-Performance Biomedical Information Retrieval'', International Journal of Data Mining and Bioinformatics. 6(2): 115-129, 2012. ISSN: 1748-5673.
  • Li, C. and Huang, X. J. "Spam Filtering Using Semantic Similarity Approach and Adaptive BPNN", Neurocomputing Journal. Elsevier Publisher. 92:88-97, 2012. ISSN: 0925-2312.
  • Chen, Y., Yin, X., Li, Z., Hu, T. and Huang, X. J. "A LDA-based Approach to Promoting Ranking Diversity for Genomics Information Retrieval'' (17 pages), to appear in BMC Genomics. June 2012. ISSN: 1471-2164.
  • Zhao, J., Huang, X. J. and Hu, T. "A Bayesian-based Prediction Model for Personalized Medical Health Care'', Proceedings of the 2012 IEEE International Conference on Bioinformatics & Biomedicine (BIBM'12). Philadelphia, USA, October 4-7, 2012
  • Kasperowicz, D. and Huang, X. J. "Semantic Matching Models for Medical Information Retrieval: A Case Study'', Proceedings of the 2012 Advances in Health Informatics Conference (AHIC'12), Toronto, ON, Canada, April 25-27, 2012.
  • Zhao, J. and Huang, X. J. "Rewarding Term Location Information to Enhance Probabilistic Information Retrieval'', Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'12), Portland, Oregon, August 12-16, 2012. (PDF)
  • Ye, Z., Huang, X. J. and Miao, J. "A Hybrid Model for Adhoc Information Retrieval'', Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'12), Portland, Oregon, August 12-16, 2012. (PDF)
  • Miao, J., Huang, X. J. and Ye, Z. "Proximity-based Rocchio's Model for Pseudo Relevance Feedback'', Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'12), Portland, Oregon, August 12-16, 2012. (PDF)

2011

  • Zhao, J., Huang, X. J. and He, B. "CRTER: Using Cross Terms to Enhance Probabilistic Information Retrieval'' (full paper), Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'11), Beijing, China, July 24-28, 2011. (PDF)
  • Zhou, X., Huang, X. J. and He, B. "Enhancing Ad-hoc Relevance Weighting Using Probability Density Estimation'' (full paper), Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'11), Beijing, China, July 24-28, 2011. (PDF)
  • He, B., Huang, X. J. and Zhou, X. "Modeling Term Proximity for Probabilistic Information Retrieval Models'' (32 pages), accepted by Information Sciences Journal. Elsevier Publisher. March 2011. ISSN: 0020-0255.
  • Hu, Q., Huang, X. J. and Miao, J. "A Robust Approach to Optimizing Multi-Source Information for Enhancing Genomics Retrieval Performance'' (18 pages), accepted by BMC Bioinformatics. January 2011. ISSN: 1471-2105.
  • Ye, Z., Huang, X. J. and Lin, H. "Finding a Good Query-Related Topic for Boosting Pseudo Relevance Feedback" (24 pages), accepted by Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher, January 2011. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890.
  • Ye, Z., Huang, X. J. and Lin, H. "A Bayesian Network Approach to Context Sensitive Query Expansion", Proceedings of the 26th ACM Symposium on Applied Computing, TaiChung, Taiwan, March 21-24, 2011
  • Yin, X., Li, Z., Huang, X. J. and Hu, X. "Promoting Ranking Diversity for Genomics Search with A Relevance-Novelty Combined Model'' (16 pages), accepted by BMC Bioinformatics. March 2011. ISSN: 1471-2105.
  • Lupu, M., Huang, X. J. and Zhu, J. "Evaluation of Chemical Information Retrieval Tools'' (16 pages), Current Challenges in Patent Information Retrieval, The Information Retrieval Series. Springer-Verlag Publisher. Vol. 29. ISBN: 978-3-642-19230-2. April 2011.

2010

  • Yin, X., Huang, X. J. and Li, Z. "Mining and Modeling Linkage Information from Citation Context for Improving Biomedical Literature Retrieval" (32 pages), accepted by Information Processing & Management: An International Journal (IPM). ELSEVIER Publisher, March 2010.
  • Zhu, J., Huang, X. J., Song, D. and Rüger, S. "Integrating Multiple Document Features in Language Models for Expert Finding" (26 pages), Knowledge and Information Systems: An International Journal (KAIS). Springer-Verlag Publisher. ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. Vol.23, No.1, 2010. pp.29-54. (online version)(PDF)
  • Ye, Z., Huang, X. J. and Lin, H. "Incorporating Rich Features to Boost Information Retrieval Performance: A SVM-regression Based Re-ranking Approach" (16 pages), accepted by Expert Systems with Applications: An International Journal. ELSEVIER Publisher, December 2010. ISSN: 0957-4174.
  • Liu, Y., Yu, X, Huang, X. and An, A. "Combining Integrated Sampling with SVM Ensembles for Learning from Imbalanced Datasets" (29 pages), accepted by Information Processing & Management: An International Journal (IPM). ELSEVIER Publisher, November 2010.
  • Hu, Q. and Huang, X. J. "Passage Extraction and Result Combination for Genomics Information Retrieval" (23 pages), Journal of Intelligent Information Systems (JIIS). Springer-Verlag Publisher. ISSN (Printed): 0925-9902 and ISSN (Online): 1573-7675. Vol.34, No.3, 2010. pp.249-274. (online version)
  • Zhao, J. and Huang, X. J. "CRTER: Using Cross Terms to Enhance Probabilistic Information Retrieval" (22 pages), Technical Report IR-201012-26, Information Retrieval and Knowledge Management Research Lab, York University, Toronto, Canada, December 26, 2010. (PDF)
  • Hu, Q., Huang, X. and Miao, J. "Exploring a Multi-Source Fusion Approach for Genomics Information Retrieval", Proceedings of the 2010 IEEE International Conference on Bioinformatics & Biomedicine, Hong Kong, China, December 18-21, 2010.
  • Hu, Q., Ye, Z. and Huang, X. J. "Enhancing Content-Based Image Retrieval Using Machine Learning Techniques", Proceedings of the the 2010 International Conference on Active Media Technology (AMT'10), Toronto, Canada, August 28-30, 2010.
  • Hu, Q. and Huang, X. J. "Genomics Information Retrieval Using a Bayesian Model for Learning and Re-ranking", Proceedings of the 2010 ACM International Conference on Bioinformatics and Computational Biology, New York, USA, August 2-4, 2010.
  • Huang, X., An, A. and Hu, Q. "Medical Search and Classification Tools for Recommendation”, Proceedings of the 33th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010.
  • Yin, X. and Huang, X. "A Survival Modeling Approach to Biomedical Search Result Diversification Using Wikipedia”, Proceedings of the 33th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010
  • Yu, X., Liu, Y., Huang, X. and An, A. "S-PLSA+: Adaptive Sentiment Analysis with Application to Sales”, Proceedings of the 33th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010
  • Yu, X., Liu, Y., Huang, X. and An, A. "A Quality-Aware Model for Sales Prediction Using Reviews", Proceedings of the 19th International World Wide Web Conference (WWW'10), Raleigh, North Carolina, USA, April 26-30, 2010.
  • Yin, X., Huang, X. and Li, Z. "Promoting Ranking Diversity for Biomedical Information Retrieval Using Wikipedia" (12 pages), Proceedings of the 32nd European Conference on Information Retrieval (ECIR'10), Milton Keynes, UK, 28-31 March, 2010 (The best paper award).
  • Huang, X. J., Jones, G., Koudas, N., Wu, X. and Collins-Thompson, K. Proceedings of the 19th ACM International Confernece on Information and Knowledge Management, Toronto, Canada, October 26-30, 2010. ACM Publisher, 1982 pages. ISBN: 978-1-4503-0099-5.
  • Yao, Y., Sun, R., Poggio, T., Liu, J., Zhong, N. and Huang, X. J. Brain Informatics, Proceedings of BI 2010. Lecture Notes in Computer Science 6334, Springer 2010, ISBN 978-3-642-15313-6. 436 pages.
  • An, X., Huang, X. and Cercone, N. "The Optimal IR in Genomics: How Far Away?", Proceedings of the 11th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing'10), Iasi, Romania, 21-27 March, 2010.
  • Zhao, J., Huang, X. and Ye, Z. "York University at TREC 2010: Chemical Track", Proceedings of the 19th Text REtrieval Conference (TREC-19), NIST Special Publication, Gaithersburg, MD, USA, November 16-19, 2010.
  • Lupu, M., Tait, J., Huang, X. and Zhu, J. "Overview of the TREC 2010 Chemical IR Track", Proceedings of the 19th Text REtrieval Conference (TREC-19), NIST Special Publication, Gaithersburg, MD, USA, November 16-19, 2010.
  • Ye, Z. and Huang, X. J. "Exploring Social Annotation Tags to Enhance Information Retrieval Performance", Proceedings of the the 2010 International Conference on Active Media Technology (AMT'10), Toronto, Canada, August 28-30, 2010.
  • Ye, Z., Huang, X. J., Hu, Q. and Lin, H. "An Integrated Approach for Medical Image Retrieval through Combining Textual and Visual Features", Multilingual Information Access Evaluation II - Multimedia Experiments. Springer-Verlag Publisher. Lecture Notes in Computer Science, Vol. 6242, pp. 195-202. ISBN: 978-3-642-15750-9. January 2010.
  • Miao, J., Huang, X. J. and Hu, Q. "York University at TRECVID 2010: Video Retrieval Evaluation", Proceedings of 2010 TRECVID Evaluation (TRECVID 2010), NIST Special Publication, Gaithersburg, MD, USA, November 15-17, 2010.

2009

  • Huang, X. and Hu, Q. "A Bayesian Learning Approach to Promoting Diversity in Ranking for Biomedical Information Retrieval", Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'09), Boston, USA, July 19-23, 2009. (online version)(PDF)
  • Ye, Z., Huang, X. and Lin, H. "A Graph-based Approach to Mining Multilingual Word Associations from Wikipedia", Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'09), Boston, USA, July 19-23, 2009.
  • Ye, Z., Huang, X. and Lin, H. "Towards A Better Performance for Medical Image Retrieval Using An Integrated Approach", Proceedings of the 10th Workshop of the Cross-Language Evaluation Forum (CLEF 2009), Corfu, Greece, September 30 - October 2, 2009.
  • Lupu, M., Huang, Jimmy X., Zhu, J. and Tait, J. “TREC-CHEM: Large Scale Chemical Information Retrieval Evaluation at TREC”. SIGIR Forum 42(1): 63-77. December 2009.
  • Yin, X., Huang, X., Hu, Q. and Li, Z. "Boosting Biomedical Information Retrieval Performance through Citation Graph: An Empirical Study", Proceedings of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'09), Bangkok, Thailand, April 27-30, 2009.
  • Yin, X., Huang, X. and Li, Z. "Towards A Better Ranking for Biomedical Information Retrieval Using Context", Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine, Washington D.C., USA, November 1-4, 2009.
  • Yin, X., Huang, X. and Li, Z. "BioCLink: A Probabilistic Approach for Improving Genomics Search with Citation Links", Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine, Washington D.C., USA, November 1-4, 2009.
  • An, A., Wan, Q., Zhao, J. and Huang, X. "Diverging Patterns: Discovering Significant Frequency Change Dissimilarities in Large Databases", Proceedings of the 18th International ACM Conference on Information and Knowledge Management (CIKM'09), Hong Kong, China, November 2-6, 2009.
  • Rohian, H., An, A., Zhao, J. and Huang, X. "Discovering Temporal Associations among Significant Changes in Gene Expression", Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine, Washington D.C., USA, November 1-4, 2009.
  • Zhao, J., Huang, X., Ye, Z. and Zhu, J. "York University at TREC 2009: Chemical Track", Proceedings of the 18th Text REtrieval Conference (TREC-18), NIST Special Publication, Gaithersburg, MD, USA, November 18-20, 2009.
  • Ye, Z., Huang, X., He, B. and Lin, H. "York University at TREC 2009: Relevance Feedback Track", Proceedings of the 18th Text REtrieval Conference (TREC-18), NIST Special Publication, Gaithersburg, MD, USA, November 18-20, 2009.
  • Lupu, M., Piroi, F., Huang, X., Zhu, J. and Tait, J. "Overview of the TREC 2009 Chemical IR Track" (24 pages), Proceedings of the 18th Text REtrieval Conference (TREC-18), NIST Special Publication, Gaithersburg, MD, USA, November 18-20, 2009.
  • Huang, X., An, A., Hu, Q. and Tu, K. "Medical Text Analytics Tools for Search and Classification", Proceedings of the 2009 Annual International Conference on Information Technology and Communications in Health (ITCH'09), Laurel Point Inn, Victoria, BC, Canada, February 19-22, 2009.
  • Andreopoulos, B., Huang, X., An, A., Labudde, D. and Hu, Q. "Promoting Diversity in Top Hits for Biomedical Passage Retrieval" (22 pages), Advances in Data Management, Springer-Verlag Publisher. ISSN (Printed): 1860-949X and ISSN (Online): 1860-9503. In press. 2009.

2008

  • Liu, Y., Huang, X., An, A. and Yu, X. "Modeling and Predicting the Helpfulness of Online Reviews", Proceedings of the 2008 IEEE International Conference on Data Mining (ICDM'08), Pisa, Italy, December 15-19, 2008.
  • Liu, Y., Yu, X., Huang, X. and An, A. The Predictive Power of Sentiments: An Empirical Study (15 pages). In Longbing Cao, Philip S. Yu, Chengqi Zhang, Huaifeng Zhang (Editors), Domain Driven Data Mining: Domain Problems and Applications, Springer-Verlag Publisher. December 2008.
  • Kovacevic, M. and Huang, X. "York University at TREC 2008: Blog Track", Proceedings of the 17th Text REtrieval Conference (TREC-17), NIST Special Publication, Gaithersburg, MD, USA, November 2008.
  • Zhu, J., Song, D., Rüger S. and Huang, X. "Modeling Document Features for Expert Finding", Proceedings of the 17th International ACM Conference on Information and Knowledge Management (CIKM'08), Napa Valley, CA, USA, October 26-30, 2008.
  • Andreopoulos, B., An, A, Huang, X. and Labuddeand, D. "Integration of Genomic, Proteomic and Biomedical Information on the Semantic Web", Proceedings of 2nd International Workshop on Conceptual Modelling for Life Sciences Applications (CMLSA 2008), Barcelona, Spain. October 20-23, 2008.
  • Liu, Y., Huang, X., An, A. and Yu, X. HelpMeter: A Nonlinear Model for Predicting the Helpfulness of Online Reviews. Proceedings of the 2008 IEEE/ACM International Conference on Web Intelligence, Sydney, Australia, December 9-12, 2008.
  • Huang, X., An, A. and Liu, Y. Web Usage Mining with Web Logs. In John Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Information Science Publishing (an imprint of Idea Group Inc.), August 2008.
  • Hu, Q. and Huang, X. A Reranking Model for Genomics Aspect Search. Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'08), Singapore, July 20-24, 2008.
  • Hu, Q. and Huang, X. A Dynamic Window Based Passage Extraction Algorithm for Genomic Information Retrieval. Proceedings of the 17th International Symposium on Methodologies for Intelligent Systems (ISMIS'08), Toronto, Canada. May 20-23, 2008.

2007

  • Yang Liu, Xiangji Huang and Aijun An. "Personalized Recommendation with Adaptive Mixture of Markov Models" , Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890. Vol.58, No.12, 2007. pp.1851-1870. (PDF)
  • Fuchun Peng and Xiangji Huang. "Machine Learning Approaches to Automatic Text Classification for Asian Languages", Journal of Documentation. Emerald Group Publishing Limited, U.K. ISSN: 0022-0418. Vol.63, No.3, 2007. pp.378-397. (PDF)
  • Huang, X., Sotoudeh-Hosseini, D., Rohian, H. and An, X. York University at TREC 2007: Genomics Track, Proceedings of the 16th Text REtrieval Conference (TREC-16), NIST Special Publication, Gaithersburg, MD, November 2007.
  • Fan, Y. and Huang, X. York University at TREC 2006: Enterprise Document Search, Proceedings of the Fifteenth Text REtrieval Conference (TREC-16), NIST Special Publication, Gaithersburg, MD, November 2007.
  • Y. Liu, X. Huang, A. An and X. Yu. "ARSA: A Sentiment-Aware Model for Predicting Sales Performance Using Blogs", Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'07), Amsterdam, July 23-27, 2007. (PDF)
  • Yao, Y., Zeng, Y., Zhong, N. and Huang, X. Knowledge Retrieval, Proceedings of the 2007 IEEE/ACM International Conference on Web Intelligence, Silicon Valley, November 2-5, 2007.
  • Bill Andreopoulos, Aijun An, Xiangji Huang and Xiaogang Wang. "Finding Molecular Complexes through Multiple Layer Clustering of Protein Interaction Networks" (22 pages), accepted by International Journal of Bioinformatics Research and Applications (IJBRA). Inderscience Publisher. ISSN (Printed): 1744-5485 and ISSN (Online): 1744-5493. Vol.3, No.1, 2007. pp.65-85.

2006

  • Xiangji Huang, Qingsong Yao and Aijun An. "Applying Language Modeling to Session Identification from Database Trace Logs" (32 pages), Knowledge and Information Systems: An International Journal (KAIS). Springer-Verlag Publisher. ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. Vol.10, No.4, 2006. pp.473-504. (online version)(PDF)
  • Xiangji Huang. "Comparison of Interestingness Measures for Web Usage Mining: An Empirical Study" (32 pages), accepted by International Journal of Information Technology and Decision Making. World Scientific Publishing Co. ISSN: 0219-6220. 2006.
  • Aijun An, Shakil Khan and Xiangji Huang. "Hierarchical Grouping of Association Rules and Its Application to a Real-World Domain" (30 pages), accepted by International Journal of Systems Science (IJSS), Special Issue on Advances in Data Mining and Its Applications. Taylor & Francis Group Publisher. ISSN (Printed): 0020-7721 and ISSN (Online): 1464-5319. To appear in 2006.
  • Xiangji Huang, Miao Wen, Aijun An and YanRui Huang. "A Platform for Okapi-Based Contextual Information Retrieval", Proceedings of ACM SIGIR 2006, Seattle, Washington, August 6-11, 2006.
  • Ming Zhong and Xiangji Huang. "Concept-Based Biomedical Text Retrieval", Proceedings of ACM SIGIR 2006, Seattle, Washington, August 6-11, 2006.
  • Wen, M. and Huang, X. York University at TREC 2006: Legal Track, Proceedings of the Fifteenth Text REtrieval Conference (TREC-15), NIST Special Publication, Gaithersburg, MD, November 2006.
  • Yang Liu, Aijun An and Xiangji Huang. "Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles", Proceedings of 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2006), Singapore, April 9-12, 2006. Lecture Notes in Computer Science (LNCS), Springer-Verlag Publisher.
  • Miao Wen and Xiangji Huang. "A Multi-level Searching and Re-ranking Framework for Information Retrieval", Proceedings of 2006 IEEE International Conference on Granular Computing, Atlanta, USA, May 10-12, 2006.
  • Xiangji Huang, YanRui Huang, Miao Wen, Aijun An, Yang Liu and Josiah Poon. "Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval", Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM'06), Hong Kong, December 18-22, 2006. (PDF)

2005

  • Xiangji Huang. "Clustering Analysis and Algorithms". In John Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Information Science Publishing (an imprint of Idea Group Inc.), ISBN: 2-59140-557-2. 2005.
  • Xiangji Huang. YanRui Huang and Miao Wen. "A Dual Index Model for Contextual Information Retrieval", Proceedings of ACM SIGIR 2005, Salvador, Brazil, August 15-19, 2005.
  • Xiangji Huang and YanRui Huang. "Using Contextual Information to Improve Retrieval Performance", Proceedings of 2005 IEEE International Conference on Granular Computing, Beijing, China, July 25-27, 2005. ISBN: 0-7803-9017-2 and IEEE Catalog Number: 05EX1036.
  • Luo Si, T. Kanungo and Xiangji Huang "Boosting Performance of Bio-Entity Recongition by Combining Results from Multiple Systems", Proceedings of the 5th ACM SIGKDD Workshop on Data Mining in Bioinformatics, Chicago, USA, August 21, 2005. (PDF)
  • Xiangji Huang. "Incorporating Contextual Retrieval into Okapi", Proceedings of ACM SIGIR Workshop on IR in Context (IRiX'05), Salvador, Brazil, August 19, 2005. ISBN: 87-7415-290-4
  • Ying Zou, Aijun An and Xiangji Huang. "Evaluation and Automatic Selection of Methods for Handling Missing Data", Proceedings of 2005 IEEE International Conference on Granular Computing, Beijing, China, July 25-27, 2005. ISBN: 0-7803-9017-2 and IEEE Catalog Number: 05EX1036.
  • Qingsong Yao, Q. Xiangji Huang and Aijun An. "A Machine Learning Approach to Identify Database Sessions Using Unlabeled Data", Proceedings of the 7th International Conference on Data Warehousing and Knowledge Discovery, Copenhagen, Denmark, August 22-26, 2005. Lecture Notes in Computer Science (LNCS) 3589: 254-264. Springer-Verlag Publisher.
  • Qingsong Yao, Q. Aijun An and Xiangji Huang. "A Distance-based Algorithm for Clustering Database User Sessions", Proceedings of the 15th International Symposium on Methodologies for Intelligent Systems (ISMIS 2005), Saratoga Springs, New York, May 25-28, 2005. Lecture Notes in Computer Science (LNCS) 3488: 562-572. Springer-Verlag Publisher.
  • Qingsong Yao, Q. Aijun An and Xiangji Huang. "Finding and Analyzing Database User Sessions", Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA 2005), Beijing, China, April 18-20, 2005. Lecture Notes in Computer Science (LNCS) 3453: 851-862. Springer-Verlag Publisher. (PDF)
  • Miao Wen, Xiangji Huang, Aijun An, and YanRui Huang. "York University at TREC 2005: HARD Track", Proceedings of the Fourteenth Text REtrieval Conference (TREC-14), NIST Special Publication, Gaithersburg, MD, November 2005.
  • Wei Cao, Aijun An, and Xiangji Huang. "York University at TREC 2005: SPAM Track", Proceedings of the Fourteenth Text REtrieval Conference (TREC-14), NIST Special Publication, Gaithersburg, MD, November 2005.
  • Mladen Kovacevic and Xiangji Huang. "York University at TREC 2005: Terabyte Track", Proceedings of the Fourteenth Text REtrieval Conference (TREC-14), NIST Special Publication, Gaithersburg, MD, November 2005.
  • Xiangji Huang, Ming Zhong and Luo Si. "York University at TREC 2005: Genomics Track", Proceedings of the Fourteenth Text REtrieval Conference (TREC-14), NIST Special Publication, Gaithersburg, MD, November 2005. (PDF)

2004

  • Xiangji Huang, Fuchun Peng, Aijun An and Dale Schuurmans. "Dynamic Web Log Session Identification with Statistical Language Models", Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890. Vol.55, No.14, 2004. pp.1290-1303. (PDF)
  • Huang, X., Huang, Y., Wen, M. and Zhong, M. York University at TREC 2004: Genomics and HARD Tracks (16 pages), In E.M. Voorhees and Lori P. Buckland, editors, Proceedings of the Thirteenth Text REtrieval Conference (TREC-13), National Institute of Standards and Technology (NIST), NIST Special Publication 500-261, Gaithersburg, MD, November 2004.
  • Aijun, An, Y. Huang, Xiangji Huang and Nick Cercone. "Feature Selection with Rough Sets for Web Page Classification", LNCS Transactions on Rough Sets, Springer Verlag, Vol.2, 2004. pp.1-13. (PDF)
  • Yang Liu, Xiangji Huang and Aijun An. "Clustering Web Surfers with Probabilistic Models in a Real Application", Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI'04), Beijing, China, September 20-24, 2004. 851-862.
  • Yang Liu, Aijun An and Xiangji Huang. "Web Surfing Recommendations in a Real Application", Proceedings of the ECML/PKDD'04 workshop on Statistical Approaches for Web Mining, Pisa, Italy, September 20-24, 2004. 2-13.

2003

  • Xiangji Huang, Fuchun Peng, Dale Schuurmans, Nick Cercone and Stephen Robertson. "Applying Machine Learning to Text Segmentation for Information Retrieval", Information Retrieval, Vol 6, Issue 4, pp.333-362, 2003. (PDF)
  • Aijun An, Shakil Khan and Xiangji Huang. "Objective and Subjective Algorithms for Grouping Association Rules", Proceedings of the Third IEEE International Conference on Data Mining (ICDM 2003), Florida, USA, November 19-22, 2003.
  • Fuchun Peng, Xiangji Huang, Dale Schuurmans and Shaojun Wang. "Text Classification in Asian Languages without Word Segmentation", Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages (IRAL 2003), Sapporo, Japan, July 2003.
  • Xiangji Huang, Fuchun Peng, Aijun An and Dale Schuurmans. "Session Boundary Detection for Association Rule Learning Using N-Gram Language Models", Proceedings of the Sixteenth Canadian Conference on Artificial Intelligence (CAI-03), Halifax, Canada, June 11-13, 2003. Lecture Notes in Computer Science (LNCS) 2671: 237-251. Springer-Verlag Publisher.

2002

  • Xiangji Huang, Aijun An, Nick Cercone and Gary Promhouse. "Discovery of Interesting Association Rules from Livelink Web Log data", Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM'02), Maebashi TERRSA, Maebashi City, Japan, December 9-12, 2002. (PDF)
  • Xiangji Huang, Aijun An, Nick Cercone and Gary Promhouse. "Comparison of Interestingness Functions for Learning Web Usage Patterns", Proceedings of the 11th International ACM Conference on Information and Knowledge Management (CIKM'02), McLean, VA, USA, November 4-9, 2002. (PDF)
  • Xiangji Huang, Fuchun Peng, Dale Schuurmans and Nick Cercone. "Waterloo at NTCIR-3: Comparing and Analysing Different Text Extraction Methods''", Proceedings of the Third NTCIR Workshop on Evaluation of Information Retrieval, Q&A, and Summarization (NTCIR'02), ISBN:4-86049-016-9, National Institute of Informatics, Tokyo Japan, October 8-10, 2002.
  • Fuchun Peng, Xiangji Huang, Dale Schuurmans and Nick Cercone. "Investigating the Relationship of Word Segmentation Performance and Retrieval Performance in Chinese IR", Proceedings of the 19th Biennial International Conference on Computational Linguistics (COLING'02), Taipei, Taiwan, August 24-September 1, 2002.
  • Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick Cercone and Stephen Robertson. "Using Self-Supervised Word Segmentation in Chinese Information Retrieval", Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02), Tampere, Finland, August 11-15, 2002.

2001

  • Xiangji Huang and Stephen Robertson. "Comparisons of Probabilistic Compound Unit Weighting Methods''", Proceedings of the 2001 IEEE ICDM Workshop on Text Mining, San Jose, USA, November 29-December 2 2001. 1-15.
  • Aijun An, Nick Cercone and Xiangji Huang. "A Case Study for Learning from Imbalanced Data Sets", Proceedings of the 14th Canadian Conference on Artificial Intelligence (CAI-03), Ottawa, Canada, June 7-9 2001. Lecture Notes in Computer Science (LNCS) 2056: 1-15. Springer-Verlag Publisher. (PDF)

2000

  • Xiangji Huang, Stephen E. Robertson, Nick Cercone and Aijun An "Probability-based Chinese Text Processing and Retrieval", Computational Intelligence: An International Journal (CI), Vol.16, No.4, 2000. pp.552-569. (PDF)
  • Xiangji Huang and Stephen E. Robertson. "A Probabilistic Approach to Chinese Information Retrieval: Theory and Experiments", Proceedings of the 22nd Annual BCS-IRSG Colloquium on Information Retrieval Research, Cambridge, England, April 2000. 178-193.

1998

  • Huang, X. and Robertson, S.E. Okapi Chinese Text Retrieval Experiments at TREC-6. In E.M. Voorhees and D. K. Harman, editors, The Sixth Text REtrieval Conference (TREC-6), National Institute of Standards and Technology (NIST), NIST Special Publication 500-240, Gaithersburg, MD, November 1998. 137-142.

1997

  • Beaulieu, M.M., Gatford, M, Huang, X., Robertson, S.E., Walker, S. and Williams, P., Okapi at TREC-5, In E.M. Voorhees and D.K.Harman, editors, Proceedings of the Fifth Text REtrieval Conference (TREC-5), National Institute of Standards and Technology (NIST), NIST Special Publication 500-238, Gaithersburg, MD, November 1997. 143-166. (PS.GZ)
  • Xiangji Huang and Stephen E. Robertson. "Application of Probabilistic Models to Chinese Text Retrieval", Journal of Documentation (JDoc), Vol.53, No.1, 1997. pp.74-79.

Home | Research | People | Collaborators | Publications | Funding | News | Photos | Contact Us