Bulk Data Shortcomings

patft is the uspto's granted patent searching screen. It slices, dices and lets you make simple or complex queries. What I would like would be for the uspto to provide all the underlying bulk data available to the publlic. They have made a staggering amount data available, but naturally in doing so they have left me wanting more! I dream of winning a contest where I get access to the uspto's database for a little while- like those contests where someone gets to run through a department store grabbing what they can in the time allowed.

uspto bulk data
Available?Data
Yes11976-date patent data
YesUSPC data for patents issued before May 2016 and all plant patents
YesCPC data for utility patents
NoCPC data for plant and reissued patents
NoIPC data for patents
NoInventor names2 for patents 1920 thru 1975
Nodocument numbers3
1 There are 305 patents not in the xml files out of literally millions of available patents.
2try a in/edison and isd/192$ in 1790 to present [entire database]
3aptft (the uspto's application searching screen) has the document_number associated with each patent application but the bulk granted patent xml files do not contain this field. This makes it hard to determine if a referenced application ever was issued.