Skip to content

TesseraCT partial tile cleanup #644

@pimvanpelt

Description

@pimvanpelt

Description

I archived IPng's halloumi2026h2 logshard with 4765491 leaves, which was consuming 146GB on disk.
Using Geomys' photocamera app, turned the log into a 4.0GB zipfile. Unzipping that, this yields 4.4GB of data on disk. The resulting unzipped data passes ct-fsck:

pim@summer:/ssd-vol0/logs$ du -sch tile/ issuer/
4.4G    tile/
946K    issuer/
4.4G    total
pim@summer:/ssd-vol0/logs$ ~pim/bin/ct-fsck -origin halloumi2026h2.log.ct.ipng.ch         -monitoring_url file:///ssd-vol0/logs/         -public_key="MFkwEwYHKoZIzj0CAQYIKoZIzj0DAQcDQgAENDhVrTtcQebbk/R6YW8Gt1RW+WZ8qscHUsI2/TW5f7bu+mEst2lskXbR9HEBQvyhkYKVTuigQjgZvDYqRhcn0Q=="         -N 20 -user_agent_info="ct-ops@ipng.ch"
I1205 00:12:37.997709 2546513 main.go:335] Using verifier string: halloumi2026h2.log.ct.ipng.ch+53348351+BTBZMBMGByqGSM49AgEGCCqGSM49AwEHA0IABDQ4Va07XEHm25P0emFvBrdUVvlmfKrHB1LCNv01uX+27vphLLdpbJF20fRxAUL8oZGClU7ooEI4Gbw2KkYXJ9E=
I1205 00:12:37.997827 2546513 main.go:112] Checking issuers CAS
I1205 00:12:37.998584 2546513 fsck.go:56] Fsck: checking log of size 4765491
I1205 00:12:54.667362 2546513 fsck.go:111] Successfully fsck'd log with size 4765491 and root ucxfVJxhzGDvsFzvHEmV/U+p3cvjWSHuwLhy+ljp17Q= (b9cc5f549c61cc60efb05cef1c4995fd4fa9ddcbe35921eec0b872fa58e9d7b4)
I1205 00:12:54.667397 2546513 main.go:82] OK
pim@summer:/ssd-vol0/logs$ cat checkpoint 
halloumi2026h2.log.ct.ipng.ch
4765491
ucxfVJxhzGDvsFzvHEmV/U+p3cvjWSHuwLhy+ljp17Q=

— halloumi2026h2.log.ct.ipng.ch UzSDUQAAAZrfajPWBAMARjBEAiBPRUhy0ldb3tJPq0liXcP1EQ+c+pseGzRFad6gNfBjhwIgai3IamIX6KlBeslhmnOp2YgY79jP0GiTqbOCCmx3d2U=

Looking at the source POSIX filesystem, I notice the majority of the files are partial tiles:

root@ctlog1:/ssd-vol0/logs/halloumi2026h2# find . -type f -ls | awk '{ total += $7} END { print total }'
270780557061

root@ctlog1:/ssd-vol0/logs/halloumi2026h2# find . -type f -ls | awk '$11 ~ /.*p\/.*/ { total += $7} END { print total }'
255180358712

And looking at the active IPng logs, comparing Gouda (Sunlight) and Halloumi (TesseraCT):

ssd-vol0/logs/gouda2025h2    569G  5.79T   569G  /ssd-vol0/logs/gouda2025h2
ssd-vol0/logs/gouda2026h1    465G  5.79T   465G  /ssd-vol0/logs/gouda2026h1

ssd-vol0/logs/halloumi2025h2   1.63T  2.85T  1.63T  /ssd-vol0/logs/halloumi2025h2
ssd-vol0/logs/halloumi2026h1   1.59T  2.85T  1.59T  /ssd-vol0/logs/halloumi2026h1

We will need to implement something like partial-aftersun for TesseraCT. If not, IPng's Halloumi logs will run out of usable ZFS space within ~14 weeks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions