Child pages
  • dblp News Home
Skip to end of metadata
Go to start of metadata

Dieser confluence Space wird nur dazu genutzt, um News bezüglich dblp in Form von Blogeinträge zu verfassen. Diese Blogeinträge werden täglich exportiert und auf der Startseite eingebunden.

ORCID state of dblp

Since more than a year now, dblp has intensified its efforts to link dblp bibliographies to ORCIDs used by that author. ORCID information are now added regularly to the dblp data set. The primary sources for ORCIDs are: first, the annual ORCID open data dump and, second, metadata directly provided by publishers who have started to increasingly label author signatures with ORCID information. Neither of those data sources are free of errors and data hick-ups, so we are still manually cleaning the ORCID data prior to adding them to the corpus. But overall, ORCIDs have helped us to correct numerous cases of homonymous and synonymous bibliographies in dblp, so it is absolutely worth our time.

Since the past big ORCID update from last week (as of July 11, 2018), you can now find in dblp:

  • 806,744 ORCID author signatures (i.e., author-publication pairs), that is 6.5% of all author signatures in the dataset
  • 165,752 bibliographies that are linked to ORCIDs (either manually or implicit)
  • 24,357 bibliographies with ORCIDs manually verified by the dblp team

If you look at computer science publications in dblp that have been published in the recent years, you will find that the fraction of author signatures with ORCID in dblp has climbed to now about 10.9% off all signatures in 2018. Of course, year 2018 is not done yet, and we are continuously working on improving the coverage further among all publications, regardless of the year of publication. The number of signatures and coverage for publication in recent years is: 

  • 2014: 892,240 signatures (8.2% coverage)
  • 2015: 930,746 signatures (7.7% coverage)
  • 2016: 984,559 signatures (6.7% coverage)
  • 2017: 1,042,103 signatures (5.6% coverage)
  • 2018: 439,350 signatures (10.9% coverage)

The oldest publication with an ORCID is from 1961. Because of the small number of publications in dblp from that time, the ORCID coverage for 1961 is a remarkable 0.12%. 

A historical version of dblp (called hdblp) is now available at zenodo. hdblp contains historical revisions of all metadata records in dblp. The file can be used to reconstruct dblp for each day between June 1999 and March 2018.

dblp via HTTPS

Finally, all dblp servers now support HTTPS. We will gradually move towards exclusively using HTTPS in the future, so please feel free to update your bookmarks.

4 million publications!

We are happy to celebrate more than 4,000,000 million computer science publications in the dblp computer science bibliography!


ORCID ( is a widely used persistent identifier scheme for researchers. As of September 2017, there are more than 3.8 million registered ORCIDs of which about 800,000 have at least one publication listed in the public ORCID corpus. Many publishers will now ask their authors to provide an ORCID when submitting or publishing a paper. We encourage all researchers to register and map their ORCID with their publications.

A few weeks ago, we started first experiments with integrating ORCIDs into dblp. At the moment, there are about 700,000 signatures (i.e., author-publication pairs) in dblp for which we know an ORCID. This is about 6% of all signatures. We expected ORCIDs to help us identify authors and create clear, unambiguous author bibliographies (see How does dblp handle homonyms and synonyms). We were not disappointed. For the initial import, we found in our corpus

  • 600 cases where an author profile was related to more than one ORCID. This indicates that the profile actually lists publications from different authors.
  • 5000 cases where the same ORCID appears in more than one author profile.

We are currently processing these cases. While we found several cases where ORCID information is wrong (e.g., authors accidentally claiming publications that where written by someone else), the data is very reliable in general. However, according to our philosophy, manual confirmation is needed. This will take some time.

A detailed description on ORCIDs on the dblp web interface and in the data dump can be found at in our F.A.Q.s

We will keep you posted on development.  

With the latest release of the dblp XML data dump, we made some extensions to the document type definition (DTD) of the XML format. The goal of these changes is to lay the groundwork for a number of planned extensions of the dblp data model that will be realized at a later date.

These changes will only affect you if you use:

Regular users of the web site or other export formats – like the BibTeX export – are not affected.

As always, the latest version of the DTD can be found at:

All upcoming as well as earlier versions of the DTD file will be preserved along with the persistent snapshot releases at:

A brief summary of the recent changes is available at:

Detailed Information on the newly added attributes can be found in the F.A.Q.:

The dblp computer science bibliography now lists more than 5000 conference and workshop series, as well as more than 1500 journals in computer science.

With the latest release of the dblp XML data dump, we made some extensions to the document type definition (DTD) of the XML format. The goal of these changes is to lay the groundwork for a number of planned extensions of the dblp data model that will be realized at a later date.

These changes will only affect you if you use:

Regular users of the web site or other export formats – like the BibTeX export – are not affected.

We encourage you to update any local copy of the dblp.dtd file on your system and adjust data import code accordingly. The new DTD should be compatible with all earlier dblp XML dumps. First changes to the actual dblp data records that make use of these changes are expected to be made not before mid of October.

As always, the latest version of the DTD can be found at:

All upcoming as well as earlier versions of the DTD file will be preserved along with the persistent snapshot releases at:

A brief summary of the recent changes is available at:

Thanks to the search API provided by our friends at the Allen Institute for Artificial Intelligence (AI2), we have added a new full-text search option to dblp. By choosing "Semantic Scholar search" as your default search action in the drop down menu next to the search bar, searching will now retrieve documents from dblp based on matches within the full texts as indexed by the Semantic Scholar service. Since dblp usually does not have access to the full texts of research articles, this is something that has not been possible before. Please try it for yourself. 

The new search option is currently an experimental feature (which may still break once in a while) and we will be happy to hear your thoughts and comments about it.

The research group of Ralf Schenkel (chair of Database and Information Systems at the University of Trier) which is (co-)hosting the dblp computer science bibliography seeks to hire a research assistant (TV-L 13). For more information, please see the official job posting (in German only).

In the past weeks, we imported bibliographic metadata for more than 9.000 PhD theses from the French archive Hyper Articles en Ligne (HAL) into dblp. These theses cover a significant part of the French computer science community, with some theses reaching back as far as the 1950s. The HAL archive is the second source of PhD metadata to be continuously imported into dblp in addition to the more than 10.000 data records already added from the Deutsche Nationalbibliothek (DNB), the German National Library. In a next step, we plan to include data from the EThOS Repository of the British Library.

We understand the PhD thesis as a very important and central publication in the professional life of a computer scientist which should have its place in a computer science bibliography. Hence, we aim to further increase the coverage of PhD theses in dblp. However, open bibliographic metadata of PhD theses is often surprisingly difficult to obtain. For many countries we are still unaware of any central repository collecting and aggregating information on national PhD theses. In some countries, bibliographic metadata of PhD theses even seems to be hidden behind a paywall.

If you are aware of an open metadata repositories for CS theses in your country or international level, we will be grateful for your advise. Please feel free to contact us.

For almost ten years now, the CompleteSearch interface has provided fast and convenient access to all of the metadata collected by dblp. Features like faceted search and fast search as you type made it a valuable asset to the computer science community. Since 2011, the service has been available under the easily memorizable domain With today's update of the dblp web system, the final stage of the integration of the CompleteSearch capabilities and its convenient search interface has concluded, and CompleteSearch is now an integral part of the dblp web system. At the same time, the domain will now point directly to the dblp main site. This domain will play a more prominent role in the future URL and ID schemes of dblp.

The CompleteSearch engine and search interface has been developed and maintained by Hannah Bast (formerly at the Max Planck Institute for Informatics in Saarbrücken, now at the University of Freiburg, Germany). Although operated independently from the main dblp web site, the development of the search interface has always been in consultation with the dblp team. The layout of both web sites had been unified, and a preview of CompleteSearch's facets had been part of the classic dblp author pages. The interface provided a lot of search features that the rather basic search of the dblp site in Trier had been lacking (like an extremely fast and responsive search interface, searching by facets, search as you type, Boolean operators for complex queries, and more), and quickly became the community's first choice when searching publications within dblp.

With the overhaul of the dblp web system (which started about two years ago), the features of the old CompleteSearch interface have been integrated step by step into the dblp web systems hosted in Trier and Dagstuhl. This has been made possible by local back-end instances of the CompleteSearch engine that are now powering all of the search, filter, and browsing capabilities of the dblp web pages. With today's update, all major features of the interface have been integrated, as well as the former URL.

This does – of course – not mean that the search interface won't see further improvements and updates in the future. After all, dblp is a project that is continuously adapting and evolving. The improvement of the search engine's front end and back end will remain a collaboration between the dblp team in Trier and Hannah Bast's research group in Freiburg.

At the same time, today's update also means that the "modern-style" dblp web page layout is now established as standard for all aspects of the dblp web site, and that the old "classic-style" layout which is currently still available at some of the dblp pages will fade out with future updates.

For dblp, it is of utmost importance that every update improves the user experience and the utility of dblp for you (the computer scientists) in your daily work. Although any major update runs the risk of breaking existing features or disrupting established work habits, please be assured that this is never made intentionally. Hence, if you feel that an important feature of the old interface is missing, or if you encounter any problems when using the new search pages, please contact us under



Today, dblp reached the wonderful "Schnapszahl" of 3,333,333 publications.

With the next upcoming XML dump, you might notice that we added some additional attributes to a (very small) number of dblp records. These changes have become necessary to realize extensions of the dblp data model in our back-end. Most of our users won't notice any difference at all, but if you are working on our XML raw data dump, you may need to update any local copy of the dblp.dtd file in order to validate future versions of the dblp.xml file. This is also true for all future  stable releases of the dblp data set. The new DTD is compatible with all earlier XML dumps.

We are happy to announce that on Thursday, June 18, 2015, the dblp computer science bibliography indexed its 3-millionth publication. Thanks to the joint effort of the University of Trier and Schloss Dagstuhl, as well as the commitment and help of our dedicated users and data partners, dblp has grown to be the world's most comprehensive open bibliographic data service in computer science. And the service is still growing strong: In each of the past three years, more than 325.000 new publications have been added to the database.

Every month, more than 450.000 distinct users visit the dblp web pages. On average, five dblp pages are viewed per second, and every three seconds a new user session is started. The dblp team understands this ever-growing demand as an incentive to keep consolidating and improving our service.

dblp was built for the computer science research community. It is your tool, and we want it to meet your specific needs, e.g., when you are searching for articles, looking for bibliographic data, or browsing authors and publication venues of computer science. So if you have any ideas or suggestions about how we can improve the utility of dblp for your daily work, please let us know!


  • No labels