The Art of Erasure: Cleaning Manga Pages in Scanlation

Once a raw manga page has been acquired, pre-processed with denoising, and had its borders tidied up, the next critical step in the scanlation pipeline is cleaning. This phase involves meticulously preparing the page to receive new, translated text, ensuring a seamless and professional final product. At its core, cleaning is about removing elements that would otherwise interfere with the translation process.

What is "Cleaning" in Scanlation?

In the context of scanlation, "cleaning" primarily refers to the process of **removing the original Japanese text** from the manga pages. This includes:

However, it's crucial to distinguish between various types of text. For our workflow, we have a specific approach regarding sound effects:

The Tool: PanelCleaner for Automated Text Removal

Manually erasing every piece of text on every page is an incredibly tedious and time-consuming endeavor. This is where automated tools become indispensable. One highly effective tool for this task is **PanelCleaner**.

PanelCleaner, an open-source project available on GitHub (https://github.com/VoxelCubes/PanelCleaner/tree/master), is a powerful Python-based solution designed to detect and remove text from manga pages. It leverages image processing techniques to identify areas containing text, creating a mask that can then be used to clear those regions.

Our Specific PanelCleaner Workflow:

It's important to note that while PanelCleaner includes built-in functionalities for "inpainting" (filling in the erased areas with generated background pixels), **we specifically do not use PanelCleaner's built-in inpainting feature**. Why?

By using PanelCleaner for text detection and masking, and then applying a separate inpainting solution, we achieve a higher quality outcome, minimizing visual distractions and maintaining the integrity of the original artwork.

Panel Cleaner

There are two ways to use PanelCleaner: through its graphical user interface (GUI) or via the command line. While both methods function identically, we'll focus on the command-line approach because it allows for programmatic control.

Before vs. After: Witnessing the Clean Canvas

The transformation a page undergoes after cleaning is dramatic. It changes from a visually "busy" page with Japanese text to a blank canvas ready for the redrawing.

Cleaning

Conclusion

Cleaning is a foundational step in scanlation, stripping away the linguistic barriers to prepare a manga page for its new life in another language. By strategically using tools like PanelCleaner for text removal, while carefully choosing to retain original SFX and deferring inpainting to a later, more specialized stage, scanlators ensure that the visual integrity of the manga is preserved while paving the way for accurate and aesthetically pleasing translations. This meticulous attention to detail is what transforms raw scans into high-quality digital releases.