The PatchDB Dataset


To foster large-scale research on vulnerability mitigation and to enable a comparison of different detection approaches, we make our dataset PatchDB from our DSN21 paper publicly available.

PatchDB is a large-scale security patch dataset that contains around 12K security patches and 24K non-security patches from the real world. You can find more details on the dataset in the paper "PatchDB: A Large-Scale Security Patch Dataset".

You can see some typical examples on our website. To download the PatchDB dataset, please carefully read the download policy, disclaim, and agreement.

If you are using PatchDB for work that will result in a publication (thesis, dissertation, paper, article), please use the following citation:

@inproceedings{wang2021PatchDB,
  title={PatchDB: A Large-Scale Security Patch Dataset},
  author={Wang, Xinda, Wang, Shu, Feng, Pengbin, Sun, Kun and Jajodia, Sushil},
  booktitle={2021 51st Annual IEEE/IFIP International Conference on Dependable Systems
and Networks (DSN)}, year={2021}, pages={149-160}, doi={10.1109/DSN48987.2021.00030} }
OR
Xinda Wang, Shu Wang, Pengbin Feng, Kun Sun and Sushil Jajodia, "PatchDB: A Large-Scale 
Security Patch Dataset," 2021 51st Annual IEEE/IFIP International Conference on Dependable
Systems and Networks (DSN 2021), 2021, pp. 149-160, doi: 10.1109/DSN48987.2021.00030.

Until now the following institutions were given access:

  1. Alan Turing Institute, UK
  2. Alibaba Group, China
  3. ARMY-C5ISR, USA
  4. Arnica, USA
  5. Brooklyn College, USA
  6. Carnegie Mellon University, USA
  7. Chinese Academy of Sciences, China
  8. Chinese University of Hong Kong, China
  9. Communication University of China, China
  10. Crowdstrike, USA
  11. CTCI, USA
  12. Dalian University of Technology, China
  13. DCI Solutions, USA
  14. Federation University Australia, Australia
  15. Fudan University, China
  16. George Mason University, USA
  17. Guangzhou University, China
  18. Hamburg University of Technology, Germany
  19. Hebei University, China
  20. Huazhong University of Science and technology, China
  21. Hunan Normal University, China
  22. IBM Research, USA
  23. Ibn Tofail University, Morocco
  24. Imperial College London, UK
  25. Institute of Information Engineering, China
  26. Korea University, South Korea
  27. Luxembourg Institute of Science and Technology, Luxembourg
  28. Malatya University, Turkey
  29. Meituan, China
  30. Metaculus, USA
  31. Mobfish AI, USA
  32. Motilal Nehru National Institute of Technology, India
  33. Nanjing University, China
  34. Nanyang Technological University, Singapore
  35. National Institute of Information and Communications Technology, Japan
  36. National Institute of Technology Warangal, India
  37. North Carolina State University, USA
  38. Northeastern University, China
  39. Northwestern University, USA
  40. Northern Illinois University, USA
  41. NousResearch, USA
  42. Ohio State University, USA
  43. Patched, Singapore
  44. Penn State University, USA
  45. Purdue University, USA
  46. Queensland University of Technology, Australia
  47. SAP Software Solutions, Germany
  48. Sapienza University of Rome, Italy
  49. Shandong University, China
  50. Sichuan University, China
  51. Simula Research Laboratory, Norway
  52. SODIUM-24, LLC, USA
  53. Southeast University, China
  54. Speed Technology Shenzhen Co.,Ltd., China
  55. SRM Valliammai Engineering College, India
  56. Technische Universit├Ąt Braunschweig, Germany
  57. Technische Universit├Ąt Dortmund, Germany
  58. Tel Aviv University, Israel
  59. Tencent, China
  60. Texas A&M University, USA
  61. Thomson Reuters, USA
  62. University of Arizona, USA
  63. University of California, Berkeley, USA
  64. University of California, Riverside, USA
  65. University of California, Santa Barbara, USA
  66. University of Chinese Academy of Sciences, China
  67. University of Electronic Science and Technology of China, China
  68. University of Luxembourg, Luxembourg
  69. University of Maryland, USA
  70. University of Missouri-Kansas City, USA
  71. University of North Texas, USA
  72. University of Texas at San Antonio, USA
  73. University of Virginia, USA
  74. Vanderbilt University, USA
  75. Washington State University, USA
  76. Worcester Polytechnic Institute, USA
  77. Wuhan University, China
  78. Xi'an Jiaotong University, China
  79. Xi'an University of Posts and Telecommunications, China
  80. Xidian University, China
  81. Zhejiang University, China

Team


The PatchDB dataset is built by Sun Security Laboratory (SunLab) at George Mason University, Fairfax, VA.

sunlab       csis


The PatchDB Dataset | Sun Security Laboratory at George Mason University