OCR Trademark Searches
Search 1,011,906 ocr'ed registration certificates.
From registration number 1 issued 1870-10-25 to 3,752,366 issued 2010-02-23.
It's predominantly registration certificates I don't think are in TESS and additional
ones described below.
My service provider wanted to double my fees for having nearly three gigabytes of ocr'ed data.
To avoid that I removed most of the data. What remains here are the 17,192 trademarks that aren't otherwise online, they aren't in tess or tsdr.
The original data set is
(another site of mine).
The searches are case insensitive (ex: a search for Marshmallow will give the same
results as searches for marshmallow or marshMallow).
the favorable words box could contain special punctuation characters as explained
Also note that words of three or fewer characters are not indexed. There is also
of stopwords that don't get indexed. Searching for words of three characters or less or for stopwords
will not match anything in the database. Additionally, words appearing in 50% or more of
the ocr'ed registration certificates are not indexed. Words like Registered or phrases like 'Patent Office' would
not be searchable. These are limitations of MySQL, the underlying free database used here.
I added ten thousand registration
certificates that aren't on uspto.gov at all (not in tess/tsdr).
I've also ocr'ed the nearly 4,000 tiff that are empty files (file size of zero bytes)
on the usamark dvds. All but two were in tsdr.
I've ocr'ed the ~600,000 registrations I do not believe are in TESS and I've continued
to ocr more of the registrations that are most likely in TESS.