The PatchDB Dataset
To foster large-scale research on vulnerability mitigation and to enable a comparison of different detection approaches, we make our dataset PatchDB from our DSN21 paper publicly available.
PatchDB is a large-scale security patch dataset that contains around 12K security patches and 24K non-security patches from the real world. You can find more details on the dataset in the paper "PatchDB: A Large-Scale Security Patch Dataset".
You can see some typical examples on our website. To download the PatchDB dataset, please carefully read the download policy, disclaim, and agreement.
If you are using PatchDB for work that will result in a publication (thesis, dissertation, paper, article), please use the following citation:
@inproceedings{wang2021PatchDB, title={PatchDB: A Large-Scale Security Patch Dataset}, author={Wang, Xinda, Wang, Shu, Feng, Pengbin, Sun, Kun and Jajodia, Sushil}, booktitle={2021 51st Annual IEEE/IFIP International Conference on Dependable SystemsOR
and Networks (DSN)}, year={2021}, pages={149-160}, doi={10.1109/DSN48987.2021.00030} }
Xinda Wang, Shu Wang, Pengbin Feng, Kun Sun and Sushil Jajodia, "PatchDB: A Large-Scale
Security Patch Dataset," 2021 51st Annual IEEE/IFIP International Conference on Dependable
Systems and Networks (DSN 2021), 2021, pp. 149-160, doi: 10.1109/DSN48987.2021.00030.
Until now the following institutions were given access:
- Alan Turing Institute, UK
- Alibaba Group, China
- ARMY-C5ISR, USA
- Arnica, USA
- Brooklyn College, USA
- Carnegie Mellon University, USA
- Chinese Academy of Sciences, China
- Chinese University of Hong Kong, China
- Communication University of China, China
- Crowdstrike, USA
- CTCI, USA
- Dalian University of Technology, China
- DCI Solutions, USA
- Federation University Australia, Australia
- Fudan University, China
- George Mason University, USA
- Guangzhou University, China
- Hamburg University of Technology, Germany
- Hebei University, China
- Huazhong University of Science and technology, China
- Hunan Normal University, China
- IBM Research, USA
- Ibn Tofail University, Morocco
- Imperial College London, UK
- Institute of Information Engineering, China
- Korea University, South Korea
- Luxembourg Institute of Science and Technology, Luxembourg
- Malatya University, Turkey
- Meituan, China
- Metaculus, USA
- Mobfish AI, USA
- Motilal Nehru National Institute of Technology, India
- Nanjing University, China
- Nanyang Technological University, Singapore
- National Institute of Information and Communications Technology, Japan
- National Institute of Technology Warangal, India
- North Carolina State University, USA
- Northeastern University, China
- Northwestern University, USA
- Northern Illinois University, USA
- NousResearch, USA
- Ohio State University, USA
- Patched, Singapore
- Penn State University, USA
- Purdue University, USA
- Queensland University of Technology, Australia
- SAP Software Solutions, Germany
- Sapienza University of Rome, Italy
- Shandong University, China
- Sichuan University, China
- Simula Research Laboratory, Norway
- SODIUM-24, LLC, USA
- Southeast University, China
- Speed Technology Shenzhen Co.,Ltd., China
- SRM Valliammai Engineering College, India
- Technische Universität Braunschweig, Germany
- Technische Universität Dortmund, Germany
- Tel Aviv University, Israel
- Tencent, China
- Texas A&M University, USA
- Thomson Reuters, USA
- University of Arizona, USA
- University of California, Berkeley, USA
- University of California, Riverside, USA
- University of California, Santa Barbara, USA
- University of Chinese Academy of Sciences, China
- University of Electronic Science and Technology of China, China
- University of Luxembourg, Luxembourg
- University of Maryland, USA
- University of Missouri-Kansas City, USA
- University of North Texas, USA
- University of Texas at San Antonio, USA
- University of Virginia, USA
- Vanderbilt University, USA
- Washington State University, USA
- Worcester Polytechnic Institute, USA
- Wuhan University, China
- Xi'an Jiaotong University, China
- Xi'an University of Posts and Telecommunications, China
- Xidian University, China
- Zhejiang University, China
Team
The PatchDB dataset is built by Sun Security Laboratory (SunLab) at George Mason University, Fairfax, VA.
The PatchDB Dataset | Sun Security Laboratory at George Mason University