姓名:冯建华

职称:教授

电话:62789150

邮箱:fengjh@tsinghua.edu.cn

主页:http://dbgroup.cs.tsinghua.edu.cn/

教育背景

工学学士 (计算机科学与技术), 清华大学, 中国, 1991;

工学硕士 (计算机科学与技术), 清华大学, 中国, 1993;

工学博士 (计算机科学与技术), 清华大学, 中国, 2006.

社会兼职

清华大学计算机科学与技术系: 副主任 (2007-);

教育部高等学校计算机科学与技术专业教学指导分委员会: 专家工作组成员 (2009-2010).

研究领域

数据库管理系统,数据安全与隐私保护;信息检索

研究概况

我长期从事与数据库相关的研究与开发工作,在纯XML数据库系统、异构数据的关键字检索、数据安全与隐私保护,以及新型数据库系统等方面取得了一系列研究成果,发表了高水平的学术论文。

在纯XML数据库系统这个研究方向上,我提出了XML数据的编码方法、语义缓存方法、结构和分支连接算法以及判断两棵XML文档树等价的“隐式条件同态”算法,有效地解决了XML数据的存储、查询求解和同态判断问题,构建了纯XML数据库系统查询处理和查询优化的整体框架。在异构数据的关键字检索这个研究方向上,我提出了异构数据的统一图模型,构建了异构数据关键字检索的索引机制和排序机制,有效解决了异构数据的融合与检索问题,形成了具有自动补全、模糊查找和即敲即得特征的关键字检索体系。同时,我和国外的研究人员一起合作,充分研究并掌握了列数据库需要解决的延迟实体化、块迭代、特定列的压缩和隐形连接等查询优化的关键技术,并正在研发大型列数据库产品HUABASE。

我的研究成果拓宽了数据库的研究领域,同时把数据库、数据挖掘和信息检索技术有机地融合在一起,不仅促进了学科方向的交叉与融合,而且引起了学术界的广泛关注。我发表的主要学术会议论文被引用的总次数超过180次,他引总次数超过130次。其中,发表在数据库领域的顶级国际会议ACM SIGMOD和ACM CIKM上的两篇论文在短短两年时间就被欧、美、澳大利亚、新加坡、日本和中国等10多个国家的学者引用80余次,其中被数据库领域的顶级国际会议ACM SIGMOD、VLDB、ICDE和EDBT及顶级国际期刊ACM TODS等引用40余次。这些学者主要来自于MIT、UIUC、DUKE、Wisconsin、Arizona State University、The Ohio State University、MPI和NUS等国际名校和著名的研究机构。

研究课题

横向合作课题: 大型列数据库系统的研究与产品开发 (2010-2012);

国家自然科学基金课题: 闪存数据库的存储技术与性能评价研究 (2009-2011);

863课题: 海量异构数据的关键字检索方法与技术研究 (2007-2010);

973二级课题: 可视媒体的信息表示与知识发现 (2006-2011);

国家自然科学基金课题: 纯XML数据库管理系统中的关键问题 (2006-2008). 

奖励与荣誉

中国计算机学会: 首届YOCSEF青年科学家奖 (2010);

北京市教育教学成果一等奖——高层次创新型计算机专业博士生培养体系 (2009);

惠普实验室创新研究奖——异构数据上高效的模式相关的关键字检索 (2008).

学术成果

[1] Guoliang Li, Jianhua Feng, Xiaofang Zhou, Jianyong Wang. Providing Built-in Keyword Search Capabilities in RDBMS. Accepted by The VLDB Journal, 2010.

[2] Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou. KEMB: A Keyword-Based XML Message Broker. Accepted by IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE), 2010.

[3] Guoliang Li, Shengyue Ji, Chen Li, Jiannan Wang, Jianhua Feng: Efficient fuzzy type-ahead search in TASTIER. Proc. 26th International Conference on Data Engineering (ICDE 2010), Long Beach, California, USA, IEEE 2010, pp. 1105-1108.

[4] Jianhua Feng, Guoliang Li, Jianyong Wang, Lizhu Zhou: Finding and ranking compact connected trees for effective keyword proximity search in XML documents. Information Systems, vol. 35, no. 2, pp. 186-203, 2010.

[5] Guoliang Li, Jianhua Feng, Jianyong Wang: Structure-aware indexing for keyword search in databases. Proc. 18th ACM Conference on Information and Knowledge Management (CIKM 2009), Hong Kong, China, ACM 2009, PP. 1453-1456.

[6] Guoliang Li, Xiaofang Zhou, Jianhua Feng, Jianyong Wang: Progressive Keyword Search in Relational Databases. Proc. 25th International Conference on Data Engineering (ICDE 2009), Shanghai, China, IEEE 2009, pp. 1183-1186.

[7] Yang Ye, Yu Zheng, Yukun Chen, Jianhua Feng, Xing Xie: Mining Individual Life Pattern Based on Location History. Proc. 10th International Conference on Mobile Data Management (MDM 2009), Taipei, Taiwan, IEEE 2009, pp. 1-10.

[8] Guoliang Li, Shengyue Ji, Chen Li, Jianhua Feng: Efficient type-ahead search on relational data: a TASTIER approach. Proc. ACM SIGMOD International Conference on Management of Data (SIGMOD 2009), Providence, Rhode Island, USA, ACM 2009, pp. 695-706.

[9] Shengyue Ji, Guoliang Li, Chen Li, Jianhua Feng: Efficient interactive fuzzy keyword search. Proc. 18th International Conference on World Wide Web (WWW 2009), Madrid, Spain, ACM 2009, pp. 371-380.

[10] Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou: Incremental sequence-based frequent query pattern mining from XML queries. Data Mining and Knowledge Discovery (DMKD), vol. 18, no. 3, pp. 472-516, 2009.

[11] Guoliang Li, Chen Li, Jianhua Feng, Lizhu Zhou: SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents. Information Sciences, vol. 179, no. 21, pp. 3745-3762, 2009.

[12] Zhiping Zeng, Anthony K. H. Tung, Jianyong Wang, Jianhua Feng, Lizhu Zhou: Comparing Stars: On Approximating Graph Edit Distance. Proceedings of the VLDB Endowment (PVLDB), vol. 2, no. 1, pp. 25-36, 2009.

[13] Guoliang Li, Jianhua Feng, Lizhu Zhou: Retune: Retrieving and Materializing Tuple Units for Effective Keyword Search over Relational Databases. Proc. 27th International Conference on Conceptual Modeling (ER 2008), Barcelona, Spain, 2008, Lecture Notes in Computer Science, vol. 5231, pp. 469-483.

[14] Guoliang Li, Beng Chin Ooi, Jianhua Feng, Jianyong Wang, Lizhu Zhou: EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. Proc. ACM SIGMOD International Conference on Management of Data (SIGMOD 2008), Vancouver, BC, Canada, ACM 2008, pp. 903-914.

[15] Guoliang Li, Xuhui Liu, Jianhua Feng, Lizhu Zhou: Efficient Similarity Search for Tree-Structured Data. Proc. 20th International Conference on Scientific and Statistical Database Management (SSDBM 2008), Hong Kong, China, 2008, Lecture Notes in Computer Science, vol. 5069, pp. 131-149.

[16] Jianhua Feng, Guoliang Li, Na Ta: A Semantic Cache Framework for Secure XML Queries. Journal of Computer Science and Technology (JCST), vol. 23, no. 6, pp. 988-997, 2008.

[17] Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou: Effective keyword search for valuable lcas over xml documents. Proc. 16th ACM Conference on Information and Knowledge Management (CIKM 2007), Lisbon, Portugal, ACM 2007, pp. 31-40.

[18] Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua Feng, Mohammed Javeed Zaki: Xproj: a framework for projected structural clustering of xml documents. Proc. 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD 2007), San Jose, California, USA, ACM 2007, pp. 46-55.

[19] Yi Wang, Shi-Xia Liu, Jianhua Feng, Lizhu Zhou: Mining Naturally Smooth Evolution of Clusters from Dynamic Data. Proc. 7th SIAM International Conference on Data Mining (SDM 2007), Minneapolis, Minnesota, USA, SIAM 2007.

[20] Jianhua Feng, Qian Qian, Jianyong Wang, Li-Zhu Zhou: Efficient Mining of Frequent Closed XML Query Pattern. Journal of Computer Science and Technology (JCST), vol. 22, no. 5, pp. 725-735, 2007.

[21] Jianhua Feng, Yuguo Liao, Yong Zhang: HCH for Checking Containment of XPath Fragment. Journal of Computer Science and Technology (JCST), vol. 22, no. 5, pp. 736-748, 2007.

[22] Yi Wang, Lizhu Zhou, Jianhua Feng, Jianyong Wang, Zhi-Qiang Liu: Mining Complex Time-Series Data by Learning Markovian Models. Proc. 6th IEEE International Conference on Data Mining (ICDM 2006), Hong Kong, China, IEEE 2006, pp. 1136-1140.

[23] Guoliang Li, Jianhua Feng, Jianyong Wang, Yong Zhang, Lizhu Zhou: Incremental Mining of Frequent Query Patterns from XML Queries for Caching. Proc. 6th IEEE International Conference on Data Mining (ICDM 2006), Hong Kong, China, IEEE 2006, pp. 350-361.