The PatchDB Dataset


To foster large-scale research on vulnerability mitigation and to enable a comparison of different detection approaches, we make our dataset PatchDB from our DSN21 paper publicly available.

PatchDB is a large-scale security patch dataset that contains around 12K security patches and 24K non-security patches from the real world. You can find more details on the dataset in the paper "PatchDB: A Large-Scale Security Patch Dataset".

You can see some typical examples on our website. To download the PatchDB dataset, please carefully read the download policy, disclaim, and agreement.

If you are using PatchDB for work that will result in a publication (thesis, dissertation, paper, article), please use the following citation:

@inproceedings{wang2021PatchDB,
  title={PatchDB: A Large-Scale Security Patch Dataset},
  author={Wang, Xinda, Wang, Shu, Feng, Pengbin, Sun, Kun and Jajodia, Sushil},
  booktitle={2021 51st Annual IEEE/IFIP International Conference on Dependable Systems
and Networks (DSN)}, year={2021}, pages={149-160}, doi={10.1109/DSN48987.2021.00030} }
OR
Xinda Wang, Shu Wang, Pengbin Feng, Kun Sun and Sushil Jajodia, "PatchDB: A Large-Scale 
Security Patch Dataset," 2021 51st Annual IEEE/IFIP International Conference on Dependable
Systems and Networks (DSN 2021), 2021, pp. 149-160, doi: 10.1109/DSN48987.2021.00030.

Until now the following institutions were given access:

  1. Alan Turing Institute, UK
  2. Alibaba Group, China
  3. ARMY-C5ISR, USA
  4. Arnica, USA
  5. Chinese Academy of Sciences, China
  6. Chinese University of Hong Kong, China
  7. Communication University of China, China
  8. Crowdstrike, USA
  9. CTCI, USA
  10. Dalian University of Technology, China
  11. DCI Solutions, USA
  12. Federation University Australia, Australia
  13. Fudan University, China
  14. George Mason University, USA
  15. Guangzhou University, China
  16. Hamburg University of Technology, Germany
  17. Hebei University, China
  18. Huazhong University of Science and technology, China
  19. Hunan Normal University, China
  20. IBM Research, USA
  21. Ibn Tofail University, Morocco
  22. Imperial College London, UK
  23. Institute of Information Engineering, China
  24. Korea University, South Korea
  25. Malatya University, Turkey
  26. Meituan, China
  27. Metaculus, USA
  28. Motilal Nehru National Institute of Technology, India
  29. Nanjing University, China
  30. Nanyang Technological University, Singapore
  31. National Institute of Technology Warangal, India
  32. North Carolina State University, USA
  33. Northeastern University, China
  34. Northern Illinois University, USA
  35. Ohio State University, USA
  36. Penn State University, USA
  37. Purdue University, USA
  38. Queensland University of Technology, Australia
  39. SAP Software Solutions, Germany
  40. Sapienza University of Rome, Italy
  41. Shandong University, China
  42. Sichuan University, China
  43. Simula Research Laboratory, Norway
  44. Southeast University, China
  45. Speed Technology Shenzhen Co.,Ltd., China
  46. SRM Valliammai Engineering College, India
  47. Technische Universität Braunschweig, Germany
  48. Tel Aviv University, Israel
  49. Tencent, China
  50. University of Arizona, USA
  51. University of California, Berkeley, USA
  52. University of California, Riverside, USA
  53. University of Chinese Academy of Sciences, China
  54. University of Electronic Science and Technology of China, China
  55. University of Luxembourg, Luxembourg
  56. University of North Texas, USA
  57. University of Missouri-Kansas City, USA
  58. University of Texas at San Antonio, USA
  59. University of Virginia, USA
  60. Vanderbilt University, USA
  61. Washington State University, USA
  62. Worcester Polytechnic Institute, USA
  63. Wuhan University, China
  64. Xi'an Jiaotong University, China
  65. Xi'an University of Posts and Telecommunications, China
  66. Xidian University, China
  67. Zhejiang University, China

Team


The PatchDB dataset is built by Sun Security Laboratory (SunLab) at George Mason University, Fairfax, VA.

sunlab       csis


The PatchDB Dataset | Sun Security Laboratory at George Mason University