Skip to content

A Python application that extracts, filters, and concatenates text entries from LiveCaption Translator's SQLite database, removing duplicates and substrings while preserving the original sequence.

Notifications You must be signed in to change notification settings

eric15342335/livecaption-translator-export-db

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

export_db

Use with the translation_history.db SQLite database file generated by https://github.com/SakiRinn/LiveCaptions-Translator application.

Technical summary:

WITH FilteredTexts AS (
  -- First, select distinct source texts with their minimum ID
  SELECT DISTINCT SourceText, MIN(Id) as MinId
  FROM TranslationHistory
  GROUP BY SourceText
),
NonContainedTexts AS (
  -- Then filter out texts that are contained within others
  SELECT FT1.SourceText, FT1.MinId
  FROM FilteredTexts FT1
  WHERE NOT EXISTS (
    SELECT 1 
    FROM FilteredTexts FT2
    WHERE FT2.SourceText != FT1.SourceText
    AND FT2.SourceText LIKE '%' || FT1.SourceText || '%'
  )
)

-- Finally, concatenate the filtered texts in ID order
select group_concat(SourceText) from (SELECT SourceText
FROM NonContainedTexts
ORDER BY MinId ASC) as a

Python3 + tkinter + sqlite3 for user-friendly UI.

About

A Python application that extracts, filters, and concatenates text entries from LiveCaption Translator's SQLite database, removing duplicates and substrings while preserving the original sequence.

Topics

Resources

Stars

Watchers

Forks

Languages