Department of Computer Science
Jiaheng Lu's homepage
Jiaheng Lu


Group leader of UDBMS

Department of Computer Science

University of Helsinki

Email :

Office : Exactum C211

[Suomi] [中文]

Short biography

Research goal: Improving the performance and usability of databases systems

I am a computer scientist and a teacher, with a research interest in databases and data management. My recent topics include multi-model database management systems, semantic string processing and job optimization for big data platform.

I was awarded Ph.D. degree in 2007 from the National University of Singapore. My PhD topic was about XML query processing. I did two years Postdoc research at the University of California, Irvine. Then I joined the Renmin University of China in 2008, where I have worked for seven years. I am now working at the University of Helsinki, Finland. I have the broad research and teaching experiences in four countries (China, Singapore, USA, and Finland).


One of my books on Hadoop is awarded as one of the Top 10 Bestselling IT Books in China.



  1. Congratulations that our both Master program of Data Science and Master program of Computer Science are the top-3 popular program for applicants in the whole university in 2020. [Details] (21.1.2021).
  2. We will give a new tutorial in DASFAA 2021! "Multi-Model Data Query Languages and Processing Paradigms" [Details] (21.1.2021).
  3. We will give a new tutorial in ICDE 2021! "Workload-Aware Performance Tuning for Autonomous DBMSs" [Details] (6.12.2020).
  4. Two new PhD students Zhengtong Yan and Shuxun Zhang joined our UDBMS group in October. Welcome Zhengtong and Shuxun! (9.10.2020).
  5. Our Data Science M.Sc. program is among the global top 10 according to Forbes. Congratulations to all program's teachers and students! [Link] (4.10.2020).
  6. I was promoted to a permanent professorship in Computer Science (Data Management). Thank all for your help to my academic career! (4.10.2020).
  7. We will give a new tutorial in CIKM 2020! "Multi-Model Data Query Languages and Processing Paradigms" [Details] (23.5.2020).
  8. Our survey paper "A Survey on Automatic Parameter Tuning for Big Data Processing Systems" has been published in ACM Computing Surveys (CSUR) [Open access], [Related VLDB tutorial] (2.5.2020).
  9. We published a new journal paper on benchmarking multi-model databases: "Holistic evaluation in multi-model databases benchmarking." [PDF][Code](6.3.2020).
  10. More news ...

Research Topics

Selected papers:

  1. Jiaheng Lu, Irena Holubova : Multi-model Databases: A New Journey to Handle the Variety of Data, ACM Computing Surveys 2019 [PDF]
  2. Jiaheng Lu, Irena Holubova, Bogdan Cautis: Multi-model Databases and Tightly Integrated Polystores CIKM 2018 Tutorial[PDF]
  3. Jiaheng Lu: Towards Benchmarking Multi-Model Databases(Abstract) CIDR 2017[PDF]
  4. Jiaheng Lu, Irena Holubova: Multi-model Data Management: What's New and What's Next? EDBT 2017 Tutorial [PDF][slides]
  5. Chao Zhang, Jiaheng Lu, Pengfei Xu, Yuxing Chen: UniBench: A Benchmark for Multi-model Database Management Systems. TPCTC 2018: 7-23 [PDF]

Selected papers:

  1. Pengfei Xu, Jiaheng Lu: Towards a Unified Framework for String Similarity Joins. PVLDB 12(12) 2019: [PDF], [Slides], [Source Codes]
  2. Pengfei Xu, Jiaheng Lu: Top-k String Auto-Completion with Synonyms. DASFAA (2) 2017: 202-218 [Slides, Source Codes]
  3. Jiaheng Lu, Chunbin Lin, Wei Wang, Chen Li, Xiaokui Xiao: Boosting the Quality of Approximate String Matching by Synonyms. ACM Trans. Database Syst. 40(3): 15 (2015)
  4. Jiaheng Lu, Chunbin Lin, Wei Wang, Chen Li, Haiyong Wang: String similarity measures and joins with synonyms. SIGMOD Conference 2013: 373-384
More research topics and papers ...

Codes and dataset release


PhD students

  1. Shuxun Zhang (2020-)
  2. Zhengtong Yan (2020-)
  3. Gongsheng Yuan (2017-)
  4. Yuxing Chen (2017-)
  5. Pengfei Xu (2016-2020) Thesis title: Efficient Approximate String Matching with Synonyms and Taxonomies
  6. Chao Zhang (2015-)
  7. Yu Liu (RenminU niversity of China) (2014-2018) (Co-supervised with Prof. Zhewei Wei)
  8. Juwei Shi (Renmin University of China) (2013-2018)
  9. Zhaoan Dong (Renmin University of China) (2013-2018)
  10. (Co-supervised with Prof. Xiaofang Zhou and Prof. Ju Fan)

    Detailed information ...

Academic service

Workshop co-chair:

  1. Workshop co-chair in ER 2018.
  2. Keyword search and data exploratory workshop 2016 with ICDE 2016
  3. Keyword search on structured data (KEYS) workshop with SIGMOD 2012
  4. XML-DM Workshop with WAIM 2010
  5. Cloud-DB workshop with CIKM 2010

Proceeding chair:

  1. IEEE ICDE Conference 2013

Program Committee:

  1. ACM SIGMOD'2010, 2013, 2014, 2015, 2016 Research track
  2. Very Large Database Conference Proceeding PVLDB 2010, 2015, 2017, 2020, 2021
  3. IEEE ICDE Conference 2011, 2017, 2019, 2020
  4. ER Conference 2018, 2019
  5. Database Systems for Advanced Applications Conference DASFAA 2010, 2012, 2013, 2014, 2020, 2021
  6. Asia-Pacific Web Conference APWeb 2008, 2009, 2011, 2013, 2014, 2015
  7. Web-age information management Conference WAIM 2014,2015,2016
  8. WAIM-APWEB Conference 2017
  9. Web System Engineering (WISE) Conference 2009
  10. Chinese Conference on Information Retrieval (CCIR) 2015, 2016
  11. Australia Database Conference ADC 2013, 2017, 2018, 2019