Un-Redactor

github.com

28 points by kvthweatt 4 hours ago


8note - 2 hours ago

> Republishing altered documents is illegal

what exactly does this mean? misrepresenting the altered document as unaltered?

i cant imagine it being illegal to do madlibs

brikym - an hour ago

You should really put some usage instructions on the README.

    uv run --with PyMuPDF --with pillow ./unredactor-main/unredact.py
I tried a couple PDFs but get "Failed to open PDF: bad argument type for built-in operation".

Redactle.net has something similar where you can double-click or tap-hold then type a note over the redacted word.

jaredwiener - an hour ago

Free Law Project also has this open source tool to detect bad redactions: https://github.com/freelawproject/x-ray

kvthweatt - an hour ago

The point is you can perform a box dimension attack.

If you have a known input, you can match all outputs.

Example: Document that DOJ took down and reuploaded that redacted Trump's name when it was previously available. They used the same size boxes in each location.

You cannot do this with handwriting, but fonts have known widths.

websiteapi - 2 hours ago

why unredact, rather than just edit the pdf to remove the redaction box and insert whatever you want? presumably you'd want a viewer to see that you modified a redaction, but why?

yellow_lead - an hour ago

With regards to the Epstein files, it seems some files are not redacted well.

For instance, this file says Mona if you remove the top layer https://www.justice.gov/epstein/files/DataSet%208/EFTA000136...

Some others I've seen include 1-3 more letters than are in the redaction.

Waterluvian - 3 hours ago

Are there tools for trying to predict possible fits for redacted data given font, black bar size, and context?

typeofhuman - 2 hours ago

> lets you put your own information over a redaction box.

This doesn't remove redactions, it lets you write over them.

- 4 hours ago
[deleted]