Programs

Courses

Textbooks by Semester

Course Web Pages - Fall 2012 - LIBR 246-06/15 Greensheet - Bibliography

LIBR 246
Text/Data/Web Mining for LIS
Bibliography

Dr. Geoffrey Z. Liu
E-mail


Course Links
Course Calendar
Group Project
Individual Assignment
LIBR 246 Resources
Online Resources
Bibliography
Software Tools
Resources
D2L
D2L Tutorial
SLIS e-Bookstore

Monographs

Berry, M. W., & Kogan, J. (Eds.). (2010). Text mining: Applications and theory. UK: John Wiley.

Calishain, T. and Dornfest, R. (2003). Google hacks: 100 industrial-strength tips & tools. Sebastopool, CA : O’Reilly & Associates.

Chakrabarti, S. (2003). Mining the Web: Discovering knowledge from hypertext data. San Francisco, CA : Morgan Kaufmann.

Do Prado, H. A., & Ferneda, E. (Eds.) (2007). Emerging technologies of text mining: Techniques and applications (premier reference source). Idea Group References.

Dunham. M. H. (2003). Data mining: Introductory and advanced topics. Upper Saddle River, NJ : Prentice Hall.

Feedman, S., & Sanger, J. (2007). The text mining handbook: Advanced approaches in analyzing unstructured data. Cambridge: Cambridge University Press.

Fleisher, C. S. and Bensoussan, B. E. (2003). Strategic and competitive analysis. Upper Saddle River, NJ : Prentice Hall.

Fuld, L. (1995). The new competitor intelligence. New York : Wiley.

Han, J. , & Kamber, M. (2006). Data mining: Concepts and techniques. 2nd ed. San Francisco, CA: Elsevier.

Larson, E. (1992). The naked customer: How our private lives become public commodities. 1st ed. Henry Holt.

Liu, B. (2007). Web data mining: Exploring hyperlinks, contents, and usage data (data-centric systems and applications). 1st ed. Springer.

Markov, Z., & Larose, D.T. (2007). Data mining the web: Uncovering patterns in web content, structure, and usage. Hoboken, NJ. : John Willey & Sons.

Shmueli, G., Patel, N.R., & Bruce, P. (2010). Data mining for business intelligence: Concepts, techniques, and applications in Microsoft Office Excel with XLMiner. 2nd ed. Hoboken, NJ: John Wiley & Sons.

Srivastava, A., & Sahami, M. (Eds). (2009). Text mining: Classification, clustering, and applications. (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series). Boca Raton, FL: Chapman and Hall/CRC.

Torgo, L. (2010).  Data mining with R: Learning with case studies. (Chapman & Hall/CRC data mining and knowledge discovery series). Chapman and Hall/CRC.

Wang, J. (Ed.) (2003). Data mining: Opportunities and Challenges. IRM Publishers.

Weiss, S. M., Indurkhya, N., & Zhang, T. (2010). Fundamentals of predictive text mining. 1st ed. Springger.

Zanasi, A. (2007). Text mining and its applicatoins to intelligence, CRM and knowledge management: Advances in management information. WIT Press.

Conference/Journal Articles & Chapters

General

Kroeze, J. H., Matthee, M. C., & Bothma, T. J. D. (2004). Differentiating between data-mining and text-mining terminology. South African Journal of Information Management (SAJIM), 6(4).

Data Mining

Baayen, R.H. (2005). Data mining at the intersection of psychology and linguistics. In Cutler, A. (Ed.), Twenty-first century psycholinguistics: Four cornerstones (pp. 69-83). Mahwah, NJ: Lawrence Erlbaum.

Freitas, A.A. (2000). Understanding the crucial differences between classification and discovery of association rules. ACM SIGKDD Explorations, 2(1), 65-69.

Goede, R. & De Villiers, C. (2004). A pattern-matching method for the analysis of case study data in a study on systems thinking motivations of data warehouse professionals. Hierdie artikel word tans oorweeg vir die proceedings CPTS2004 van 2004 se konf.

Schoeman, J., Ground, M. & Matthee, M. (2003). Getting a clearer picture: a business application of visual data mining. Conference on Data Mining including Building Applications for CRM & Competitive Intelligence, December 2003, Rio de Janeiro, Brazil.

Text Mining

Driel, M.A. van, Bruggeman, J., Vriend, G., Brunner, H.G., & Leunissen, J.A.M. (2006). A text-mining analysis of the human phenome. European Journal of Human Genetics, 14(5), 535-42.

Fan, W., Wallace, L., Rich, S., & Zhang, Z. (2006). Tapping the power of text mining, Communications of ACM, 49(9), 76-82, 2006.

Hearst, M. (1999). Untangling text data mining. Proceedings of ACL'99: the 37th Annual Meeting of the Association for Computational Linguistics, University of Maryland, June 20-26.

Kroeze, J. H. (2004). Text mining: Discovering unknown patterns in free text. In  Encyclopedia of Data Warehousing and Mining. In Press.

Ming, N., & Baumer, E. (2011). Using text mining to characterize online discussion facilitation. Journal of Asynchronous Learning Networks, 15(2), 71-108.

Smalheiser, N.R. (2012). Literature-based discovery: Beyond the ABCs. Journal of American Soceity for Information Science and Technology, 63(2), 218-224.

Sood, S.O., Churchill, E.F., & Antin, J. (2012). Automatic identification of personal insults on social news sites. Journal of American Soceity for Information Science and Technology, 63(2), 270-285.

Web (Content & Hyperlink Structure) Mining

Aaron, R. D. and Naylor, E. (2001). Tools for searching the "Deep Web". Competitive Intelligence Magazine, 4(4), 47-49.

Chen, H., Chau, M.l, and Zebg, D. (2002). CI Spider: A tool for competitive intelligence on the web. Decision Support Systems, 34(1), 1-17.

Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM , 46(5), 604-632.

Krasnow, J. D. (2000). The competitive intelligence and national security threat from website job listings. (Available on line as of April 18, 2003).

Ku, L.W., & Chen, H.H. (2007). Mining opinions from the web: Beyond relevance retrieval. Journal of American Society for Information Science and Technology, 58(12), 1838-1850.

Lam, W.Yang, C.C., & Menczer, F. (2007). Introduction to the speicial topic section on mining web resources for enhancing information retrieval. Journal of American Society for Information Science and Technology, 58(12), 1791-1792.

Liu, Y., Zhang, M., Cen, R., Ru, L., & Ma, S. (2007). Data cleansing for web information retrieval using query independent features. Journal of American Society for Information Science and Technology, 58(12), 1884-1898.

Perugini, S., & Ramakrishnan, N. (2007). Mining web functional dependencies for flexible information access. Journal of American Society for Information Science and Technology, 58(12), 1805-1819.

Sun, A. & Lim, E.P. (2006). Web-unit based mining of homepage relationships. Journal of American Society for Information Science and Technology, 57(3), 394-407

Van Wel, L. (2004). Ethical issues in web data mining. Ethics and Information Technology, 6(2). doi: 10.1023/B:ETIN.0000047476.05912.3d

Wang, F.L., & Yang, C.C. (2007). Mining web data for Chinese segmentation. Journal of American Society for Information Science and Technology, 58(12), 1820-1837.

Web Use & Transaction Log Analysis

Banks, J. (2000). Are transaction logs useful? A ten year study. Journal of Southern Academic and Special Librarianship, 1(3).

Baeza-Yates, R., Hurtado, C., & Mendoza, M. (2007). Improving search engines by query clustering. Journal of American Society for Information Science and Technology, 58(12), 1793-1804.

Blecic, D.D., Bangalore, N.S., Dorsch, J.L., Henderson, C.L., Keonig, M.H., & Weller, A.C. (1998). Using transaction log analysis to improve OPAC retrieval results. College & Research Libraries, Jan. 1998. 39-50.

Bollen, J., Beot-Arie, O., and Van de Sompel, H. (2005). The bX project: Federating and mining usage logs from linking servers. Project Briefing: Fall 2005 Task Force Meeting. Available online at http://www.cni.org/tfms/2005b.fall/abstracts/PB-bx-bollen.html

Davis, P.M. (2003). Information-seeking behavior of chemists: A transaction log analysis of referral URLs. Journal of the American Society for Information Science and Technology, 55(4), 326-332.

Guruprasad, R., Nikam, K. Rao, M. G., & Mudkavi, V. Y. (2009). Web log analysis of e-journal usage and scholarly communication: A case study of e-journal (full-text) download patterns of NAL scientists and engineers. Information Studies, 15(4), 201-232. Available from http://nal-ir.nal.res.in/9073/1/WEB-LOG-ANALYSIS.pdf

Jansen, B. J., Spink, A, Pederson, J. (2005). The effect of specialized multimedia collections on web searching. Journal of Web Engineering. 3(3/4), 182-199.

Jones, S., Cunningham, S.J., McNab, R. J., & Boddie, S. (2000). A transaction log analysis of a digital library. International Journal on Digital Libraries, 3(2),152-169.

Lee, L.-H, & Chen, H.-H. (2012). Mining search intents for collaborative cyberporn filtering. Journal of the American Society for Information Science and Technology, 63(2), 366-376.

Mahoui, M., & Cunningham, S.J. (2000). A comparative transaction log analysis of two computing collections. Lecture Notes in Computer Science, 1923/2000.

Nicholas, D. (2003). Assessing used content across five digital health information services using transaction log files. Journal of Information Science, 29(6), 499-515.

Shi, X., & Yang, C.C. (2007). Mining related queries from web search engine query logs using an improved association rule mining model. Journal of American Society for Information Science and Technology, 58(12), 1871-7883.

Snyder, C. (2005). Transaction log analyses of eletronic book (e-books) usage. Against the Grain, Februrary, 85-89.

Spink, A., Park, M., Jansen, B. J., & Pedersen, J. (2006). Multitasking during web search sessions. Information Processing & Management, 42(1), 264-275.

Srikant, R., & Yang, Y. (2001, May). Mining web logs to improve website organization. WWW10, May 2-5, 2001, Hong Kong. ACM 1-58113-348-0/01/0005.

Taha, A. (2004). Wired research: Transaction log analysis of e-journal databases to assess the research activities and trends in UAE University. Nord I&D Knowledge and Change, 150-159.

Warren, N. (2002). Website log analysis: Approaches for the library of the National Institute of Environmental Health Sciences. Master thesis. University of North Carolina at Chapel Hill: School of Information and Library Science.  Available from http://ils.unc.edu/MSpapers/2785.pdf

Competitive Intelligence

Antia, K.D. & Hesford, J.W. (2007). A process-oriented view of competitive intelligence and its impact on organizational performance. Journal of Competitive Intelligence and Management, 4(1), 5-33.

Chen, H., Chau, M.l, and Zebg, D. (2002). CI Spider: A tool for competitive intelligence on the web. Decision Support Systems, 34(1), 1-17.

Fleisher, C.S., Wright, S., & Tindale, R. (2007). Bibliography and assessment of key competitive intelligence scholarship: Part 4 (2003-2006). Journal of Competitive Intelligence and Management, 4(1), 34-107.

Herring, J. P. (1998). What is intelligence analysis? Competitive Intelligence Magazine, 1(2),  13-16.

Hughes, S., & Beasley, F. (2007). An examination of the existence and usage of competitive intelligence in professional sports. Journal of Competitive Intelligence and Management, 4(1), 108-126.

Krasnow, J. D. (2000). The competitive intelligence and national security threat from website job listings. (Available on line as of April 18, 2003).

Miree, C.E., York, K.M., & Lombardo, S.V. (2007). Using competitive intelligence processes to create value in the healthcare industry. Journal of Competitive Intelligence and Management, 4(1), 127-146.

Data/Text/Web Mining for Library and Information Services

Banerjee, K. (1998). Is data mining right for your library? Computers in Libraries, 18(10), 28-31.

Battioui, C. (2006). Data mining techniques to analyze a library database. Proceedings of SUGI 31. San Francisco, March 26-29, 2006.

Blecic, D.D., Bangalore, N. S., Dorsch, J. L, Henderson, C. L, Koenig, M. H., & Weller, A. C. (1998a). Using transaction log analysis to improve OPAC retrieval results. College & Research Libraries, 59(1), 39-50.

Chan, C.C., Lee, M-H., & Kwang, Y-C. (2007). Association rules mining for knowledge management: A case study of library services. Proceedings of the 9th WSEAS International Conference on Mathematical Methods and Computational Techniques in Electrical Engineering, Arcachon, October 13-15, 2007. http://www.wseas.us/e-library/conferences/2007franceel/papers/571-242.pdf

Chiang, K. (2010). Data mining, data fusion, and libraries. The 31st Annual IATUL Conference (International Association of Scientific and Technological University Libraries).

Collier, H. (2003). Data mining: Does it have applications within the world of libraries? Series: The Journal for the Series Community, 16(2), 209-210.

Coscia, M., Giannotti, F., & Pensa, R. (2009). Social network analysis as knowledge discovery process: A case study on digital bibliography. 2009 International Conference on Advances in Social Network Analysis and Mining, Athens, Greece, July 20-22. doi: http://doi.ieeecomputersociety.org/10.1109/ASONAM.2009.65

Dhiman, A.K. (2003). Data mining and its use in libraries. CALIBER 2003: Ahmedabad. Available from http://shodhganga.inflibnet.ac.in/dxml/bitstream/handle/1944/222/cali_53.pdf?sequence=1

Guenther, K. (2000). Applying data mining principles to library data collection. Computers in Libraries, 20(4), 60-63.

Kao, S., Chang, H., & Lin, C. (2003). Decision support for the academic library acquisition budget allocation via circulation database mining. Information Processing &Management, 39(1), 133-148.

Lavoie, B., Dempsey, L., & Connaway, L.S. (2006, Jan. 15). Making data work harder. Library Journal e-Newsletters. LibraryJournal.com

Mancini, D. D. (1996). Mining your automated system for systemwide decision making. Library Administration & Management, 10(1), 11-15.

Nicholson, S. (2003). The bibliomining process: Data warehousing and data mining for library decision-making. Information Technology and Libraries, 22 (4), 146-151.

Nicholson, S. (2003). Avoiding the Great Data-Wipe of Ought-Three. American Libraries, 34(9), 36.

Nicholson, S. (2003). Bibliomining for automated collection development in a digital library setting: Using data mining to discover web-based scholarly research works. Journal of the American Society for Information Science and Technology, 54(12), 1081-1090.

Nicholson, S. (2005). A framework for Internet archeology: Discovering use patterns in digital library and Web–based information resources. First Monday, 10(2). Available online at http://www.firstmonday.org/issues/issue10_2/nicholson/index.html

Nicholson, S. (2006). The basis for bibliomining: Frameworks for bringing together usage-based data mining and bibliometrics through data warehousing in digital library services. Information Processing and Management, 42(3), 785-804.

Nicholson, S. (2006, January 15). Proof is in the pattern. Library Journal netConnect, Supplement to Library Journal, Winter 2006. 2-6.

Nicholson, S. (2011). The bibliomining process: Data warehousing and data mining for library decision making. Information Technology & Libraries (ITAL), 22(4).

Nicholson, S. & Stanton, J. (2004). Gaining strategic advantage through bibliomining: Data mining for management decisions in corporate, special, digital, and traditional libraries. In Nemati, H. & Barko, C. (Eds.). Organizational data mining: Leveraging enterprise data resources for optimal performance (pp. 247-262). Hershey, PA: Idea Group Publishing.

Papatheodorou, C., Kapidakis, S. Sfakakis, M., and Vassiliou, A. (2003). Mining user communities in digital libraries, Information Technology and Libraries 22(4). 152-157.

Peters, T. (1996). Using transaction log analysis for library management information. Library Administration & Management, 10(1), 20-25.

Prakash, K.; Chand, P. & Gohel, U. (2004). Application of data mining in library and information services. In 2nd Convention PLANNER-2004, Manipur Uni., Imphal, 4-5 November 2004. Available at http://shodhganga.inflibnet.ac.in/dxml/bitstream/handle/1944/435/04Planner_22.pdf?sequence=1

Shieh, J.C. (2010). The integration system for librarians’ bibliomining. The Electronic Library, 28(5), 709-721.

Wu, C.H. (2003). Data mining applied to material acquisition budget allocation for libraries: Design and development. Expert Systems with Applications, 25(3), 401-411.

Wu, C.H., Lee, T.Z., & Kao, S.C. (2004). Knowledge discovery applied to material acquisition for libraries. Information Processing & Management, 40(4), 709-725.

Yan, F. Zhang, M., Tang, J., Sun, T., Deng, Z., & Xiao, L. (2010). Users’ book-loan behaviors analysis and knowledge dependency mining. Web-Age Information Management: Lecture Notes in Computer Science, 6184, 206-217.

Yu, P. (2011). Data mining in library reader management. Proceedings of International Conference on Network Computing and Information Security (NCIS), (pp.54-57). IEEE Xplore Digital Library. doi: 10.1109/NCIS.2011.109 

BlogsCommunity Profiles   | Databases  | eBookstore  | Maps  | PhD  | Second Life |