Patent Application Bibliographic Data (https://eipweb.uspto.gov/SOMS/start.swe?SWECmd=Start&SWEHo=eipweb.uspto.gov ) contains the bibliographic text (i.e., front page) of each patent application (non-provisional utility and plant) published weekly (Thursdays) for calendar years 2001 to the present. Images and drawings are excluded.
Patent Application Data (text and images) is available for a fee.
Bibliographic Data is available in XML format in accordance with various versions of the Patent Application Document Type Definition (DTD). Starting with version 4.0, International Common Element (ICE) compatibility was established.
Bibliographic data has been consolidated on a weekly basis and compressed into zip files (one zip file per week) which must be downloaded separately. The zip files have been grouped by year into separate product directories as shown in the following table.
Bibliographic Data Product Directory
03/15/2001 – 12/2001
01/2002 – 12/2003
01/2003 – 12/2003
01/2004 – 12/2004
01/2005 – 12/2005
01/2006 – 12/2006
01/2007 – 12/2007
01/2008 – 12/2008
01/2009 – 12/2009
01/2009 – 12/2010
The name used to identify each zip file takes the following form:
Where the date of publication is represented by "yyyymmdd" and "nn" is a two-digit, fixed-length number (with leading zero) representing the sequentially-numbered week of the year.
Each zip file contains three (3) files for the weekly publication:
ipabyyyymmdd.xml (bibliographic information in XML)
ipabyyyymmddlst.txt (list of published patent application numbers in ascending order)
ipabyyyymmddrpt.txt (statistical/summary report)
Note: Prior to version 4.0, the prefix of the name of the zip file and the files contained within it were of the form: pabyyyymmdd .