Furry (the best of) E621 via TBIB almost SFW sampled with metadata :: Nyaa ISS

Furry (the best of) E621 via TBIB almost SFW sampled with metadata

Category:
Date:
2022-10-01 12:21 UTC
Submitter:
Seeders:
5
Information:
No information.
Leechers:
0
File size:
162.1 GiB
Completed:
45
Info hash:
12300d111410f19f3ede0ffb9bf2adaf27dbda7e
This is FURRY-CENTRIC site rip originated from **e621:net** imageboard grabbed mostly via **tbib:org** crossposting: - **TBIB** for interval **04.2016-07.2022** post ID 5.000.000 - 11.000.000 - **E621** only topmost **up to 07.2016** post ID 10.000 - 999.999 **This rips is not intended to be "complete and maximum quality" but rather "representative the best of" to help anybody to open the furry world while not bumping into yiff (furry hentai, often male/male) and comix stockpiles** Another reason is neural network training over art images. There are [promising results](https://github.com/aperveyev/booru_yolo) for specie-specific head classes (dragonhead, ponyhead, Judy Hopps, Nick Wilde, ...), stay tuned. Manually: - comic and 4koma, most of line-arts, segmented scans and overtexted covers filtered out - crops done when large simple or dirty background, occationally gamma correction and other nontrivial improvements made Also a lot of ~~handjob~~ manual filtering done to avoid obviously unsafe art and throttle most of furry fetishes. Despite furry is not SFW by definition, (almost) no frontal nudity and evident adult activity left here so R14+ seems applicable. ### This release contains: - **279.736 JPG images** * renamed to contain **ID - up_to_3_copyrights ~ up_to_5_species_or_characters (up_to_2_artists)** * PNG >> JPG (94% quality) [converted](https://imagemagick.org), some of them "sampled" to reasonable size / volume * deduplicated using [AntiDupl](https://github.com/ermig1979/AntiDupl) up to 4% similarity * splitted / zipped into folders by ID range and also **Q**uestionable and e**X**tra separated (use [MaxView](https://www.faststone.org/FSMaxViewDetail.htm) or unzip to browse) . - additional TSV (tab separated text) metadata * key parameters for every image (from imageboard and released) [spreadsheet](https://www.libreoffice.org/download/download-libreoffice) capable * tag-to-image relations - 8.167.078 rows; involve some [tool](https://gnuwin32.sourceforge.net/packages/gawk.htm) to use . ### More about sampling 1) [detected](https://exiftool.org) image properties ``` exiftool -filecreatedate -imagesize -filesize# -filetype -JPEGQualityEstimate -csv -r B:\TBIB\ > exif.txt ``` 2) [sophisticatedly](https://www.oracle.com/cis/database/technologies/xe-downloads.html) used ``` select 'magick convert "'||sourcefile||'" '|| case when iw/ih between 0.8 and 1.2 and px>4000000 then '-resize 1920x1920^>' when iw/ih<0.8 and px>5000000 then '-resize 2480x2480^>' when iw/ih>1.2 and px>6000000 then '-resize 2560x2560^>' else to_char(null) end||' '|| case when jq>=98 then '-quality 94' else to_char(null) end||' '|| case when filesize/(iw*ih)>0.7 then '-blur 4' else to_char(null) end|| ' "'||replace(sourcefile,'\tbib\','\tbic\')||'"' mm from exif e where ( jq between 98 and 100 or (iw/ih between 0.8 and 1.2 and px>4000000) or (iw/ih<0.8 and px>5000000) or (iw/ih>1.2 and px>6000000) ) and ((filesize>1600000 and jq>84) or filesize>4000000 or (filesize/(iw*ih)>0.7) ) order by fpath desc, fname ``` 3) image left untouched when minimal or negative effect of sampling . ### More about metadata . #### TBIB_E621_2022.tsv . FID - imageboard post ID (e621 when < 1000000, tbib when >= 5000000) **for torrent content** FPATH - folder / zip name FNAME - file name TORR_FSIZE - file size, bytes TORR_ISIZE - image size WxH TORR_JQ - JPEG quality TORR_MD5 - checksum **imageboard originated** if available ORIG_DT - posting date ORIG_RATE - Safe / Questionable ORIG_ISIZE - WxH ORIG_EXT - image type (extension) ORIG_MD5 - checksum **imagemagick:org** calculated TENTR - enthropy (complexity) TSKEW - skewness (black/white balance) TSTDDEV - (black/white contrast) TCOLORS - count of colors **keras-craft text detector** calculated TXSIZE - total text area TXCNT - number of text pieces . #### TBIB_E621_2022_TAGS.tsv . FID - imageboard post ID TAG - string tag TAG_CAT - tag category COPYRIGHT / CHARACTER / SPECIE / ARTIST / GENERAL or UNKNOWN

File list

  • TBIB_E621_2022
    • 01eeeeee.q.zip (906.1 MiB)
    • 01eeeeee.zip (577.4 MiB)
    • 050xxxxx.zip (579.9 MiB)
    • 051xxxxx.zip (1018.6 MiB)
    • 052xxxxx.zip (1.3 GiB)
    • 053xxxxx.zip (726.6 MiB)
    • 054xxxxx.zip (1.6 GiB)
    • 055xxxxx.zip (1.5 GiB)
    • 056xxxxx.zip (1.9 GiB)
    • 057xxxxx.zip (1.3 GiB)
    • 058xxxxx.zip (1.3 GiB)
    • 059xxxxx.zip (1.6 GiB)
    • 060xxxxx.q.zip (557.1 MiB)
    • 060xxxxx.zip (1.6 GiB)
    • 061xxxxx.q.zip (539.2 MiB)
    • 061xxxxx.zip (1.4 GiB)
    • 062xxxxx.q.zip (558.0 MiB)
    • 062xxxxx.zip (1.6 GiB)
    • 063xxxxx.q.zip (523.5 MiB)
    • 063xxxxx.zip (1.6 GiB)
    • 064xxxxx.q.zip (621.0 MiB)
    • 064xxxxx.zip (1.6 GiB)
    • 065xxxxx.q.zip (665.7 MiB)
    • 065xxxxx.zip (1.8 GiB)
    • 066xxxxx.q.zip (718.0 MiB)
    • 066xxxxx.zip (1.8 GiB)
    • 067xxxxx.q.zip (747.4 MiB)
    • 067xxxxx.zip (1.8 GiB)
    • 068xxxxx.q.zip (690.6 MiB)
    • 068xxxxx.zip (1.5 GiB)
    • 069xxxxx.q.zip (649.1 MiB)
    • 069xxxxx.zip (1.6 GiB)
    • 070xxxxx.q.zip (710.4 MiB)
    • 070xxxxx.zip (1.6 GiB)
    • 071xxxxx.q.zip (657.0 MiB)
    • 071xxxxx.zip (1.6 GiB)
    • 072xxxxx.q.zip (648.3 MiB)
    • 072xxxxx.zip (1.6 GiB)
    • 073xxxxx.q.zip (722.4 MiB)
    • 073xxxxx.zip (1.6 GiB)
    • 074xxxxx.q.zip (688.1 MiB)
    • 074xxxxx.zip (1.5 GiB)
    • 075xxxxx.q.zip (735.5 MiB)
    • 075xxxxx.zip (1.5 GiB)
    • 076xxxxx.q.zip (971.0 MiB)
    • 076xxxxx.zip (2.2 GiB)
    • 077xxxxx.q.zip (1.1 GiB)
    • 077xxxxx.zip (2.3 GiB)
    • 078xxxxx.q.zip (1012.9 MiB)
    • 078xxxxx.zip (2.1 GiB)
    • 079xxxxx.q.zip (870.0 MiB)
    • 079xxxxx.zip (2.6 GiB)
    • 080xxxxx.q.zip (739.7 MiB)
    • 080xxxxx.zip (2.3 GiB)
    • 081xxxxx.q.zip (747.0 MiB)
    • 081xxxxx.zip (2.0 GiB)
    • 082xxxxx.q.zip (733.5 MiB)
    • 082xxxxx.zip (2.0 GiB)
    • 083xxxxx.q.zip (755.2 MiB)
    • 083xxxxx.zip (2.2 GiB)
    • 084xxxxx.q.zip (669.9 MiB)
    • 084xxxxx.zip (1.9 GiB)
    • 085xxxxx.q.zip (660.5 MiB)
    • 085xxxxx.zip (1.8 GiB)
    • 086xxxxx.q.zip (793.9 MiB)
    • 086xxxxx.zip (1.8 GiB)
    • 087xxxxx.q.zip (678.2 MiB)
    • 087xxxxx.zip (1.7 GiB)
    • 088xxxxx.q.zip (697.3 MiB)
    • 088xxxxx.zip (1.8 GiB)
    • 089xxxxx.q.zip (702.7 MiB)
    • 089xxxxx.zip (2.1 GiB)
    • 090xxxxx.q.zip (1.3 GiB)
    • 090xxxxx.zip (2.4 GiB)
    • 091xxxxx.q.zip (1.1 GiB)
    • 091xxxxx.zip (1.9 GiB)
    • 092xxxxx.q.zip (1.2 GiB)
    • 092xxxxx.zip (2.0 GiB)
    • 093xxxxx.q.zip (1.5 GiB)
    • 093xxxxx.zip (2.5 GiB)
    • 094xxxxx.q.zip (1.4 GiB)
    • 094xxxxx.zip (2.2 GiB)
    • 095xxxxx.q.zip (618.2 MiB)
    • 095xxxxx.zip (1.1 GiB)
    • 096xxxxx.q.zip (438.7 MiB)
    • 096xxxxx.zip (766.9 MiB)
    • 098xxxxx.q.zip (2.1 GiB)
    • 098xxxxx.zip (3.2 GiB)
    • 100xxxxx.Q.zip (2.0 GiB)
    • 100xxxxx.X.zip (1.3 GiB)
    • 100xxxxx.zip (5.4 GiB)
    • 101xxxxx.Q.zip (1.4 GiB)
    • 101xxxxx.X.zip (761.0 MiB)
    • 101xxxxx.zip (3.1 GiB)
    • 102xxxxx.Q.zip (1.1 GiB)
    • 102xxxxx.X.zip (592.2 MiB)
    • 102xxxxx.zip (2.2 GiB)
    • 103xxxxx.Q.zip (1.2 GiB)
    • 103xxxxx.X.zip (627.9 MiB)
    • 103xxxxx.zip (2.2 GiB)
    • 104xxxxx.Q.zip (1.2 GiB)
    • 104xxxxx.X.zip (517.6 MiB)
    • 104xxxxx.zip (2.3 GiB)
    • 105xxxxx.Q.zip (1.1 GiB)
    • 105xxxxx.X.zip (559.5 MiB)
    • 105xxxxx.zip (2.2 GiB)
    • 106xxxxx.Q.zip (1.2 GiB)
    • 106xxxxx.X.zip (607.5 MiB)
    • 106xxxxx.zip (2.3 GiB)
    • 107xxxxx.Q.zip (1.1 GiB)
    • 107xxxxx.X.zip (558.6 MiB)
    • 107xxxxx.zip (2.2 GiB)
    • 108xxxxx.Q.zip (1.3 GiB)
    • 108xxxxx.X.zip (629.9 MiB)
    • 108xxxxx.zip (2.3 GiB)
    • 109xxxxx.Q.zip (1.3 GiB)
    • 109xxxxx.X.zip (634.2 MiB)
    • 109xxxxx.zip (2.6 GiB)
    • TBIB_E621_2022.tsv (72.9 MiB)
    • TBIB_E621_2022_TAGS.tsv (210.3 MiB)