syncabook - cheat

s# syncabook

syncabook is a set of tools for creating ebooks with synchronized text and audio (a.k.a. read along, read aloud; like Amazon’s Whispersync). You start with a list of text files (plaintext or XTML) and a list of audio files (.wav or .mp3) and get an ebook in the open EPUB3 with Media Overlays format.
The synchronization is done automatically using the afaligner library. It is a forced aligner that works by synthesizing text and then aligning synthesized and recorded audio using a variation of the DTW (Dynamic Time Warping) algorithm. [r4victor/syncabook]

syncabook是一套用于创建带有同步文本和音频的电子书的工具（也称为“跟读”、“朗读”；类似于亚马逊的Whispersync）。你可以从一系列文本文件（纯文本或XTML）和一系列音频文件（.wav或.mp3）开始，最终得到一个采用开放EPUB3带媒体叠加格式的电子书。
同步是通过使用afaligner库自动完成的。它是一个强制对齐器，通过合成文本并使用DTW（动态时间规整）算法的一种变体对合成音频和录制音频进行对齐。 [r4victor/syncabook]

install

git clone --depth=1 https://github.com/r4victor/syncabook
cd syncabook

# Or
mkdir syncabook
cd syncabook
git clone --depth=1 https://github.com/scillidan/syncabook
git clone --depth=1 https://github.com/scillidan/afaligner
vim Dockerfile

FROM python:3.9-slim

RUN apt update -q \
	&& apt install --no-install-recommends -yq espeak \
	libespeak-dev \
	ffmpeg \
	&& apt install -yq gcc

RUN pip install --upgrade pip
RUN pip install numpy==1.23.4
RUN pip install pytest==7.1.3

WORKDIR /syncabook

COPY afaligner ./afaligner
COPY syncabook ./syncabook

RUN pip install -e afaligner
RUN pip install -r syncabook/requirements/base.txt
RUN pip install -e syncabook

WORKDIR /
ENTRYPOINT ["syncabook"]

sudo docker build -t <user>/syncabook -f Dockerfile .

sudo docker run --rm -it <user>/syncabook:latest -h

Windows 10

Get aeneas-win64-setup-*.exe from Releases - sillsdev/aeneas-installer.
set "AENEAS_FORCE_CEW=True"

usage

cd books
mkdir alice_in_wonderland
cd alice_in_wonderland
mkdir plaintext

For example, download “Alice in Wonderland, Retold in Words of One Syllable” by Carroll and Gorham, the Plain Text UTF-8 format from Project Gutenberg, rename to text.txt.
Download the corresponding book’s vocal reading audio files from LibriVox, rename to audio.zip, then decompress to audio/.
In plaintext/, create a 001_title.txt.
Open text.txt, cut all content that above chapter strings liked CHAPTER I, ACT I into 001_title.txt.

# syncabook download_files <url> <book>
syncabook split_text --mode opening --p <book_index> <book>\text.txt <book>\plaintext
# syncabook split_text --mode delimeter --p <book_index> <book>\text.txt <book>\plaintext
# syncabook split_text --mode equal --n 2 <book>\text.txt <book>\plaintext
syncabook to_xhtml <book>/plaintext <book>/sync_text
syncabook sync <book>
syncabook create <book>

For example:

sudo docker run --rm -v ~/Git/_/synclibrivox/books:/books -it scillidan/syncabook:latest split_text --mode opening --p CHAPTER books/alice_in_wonderland/text.txt books/alice_in_wonderland/plaintext
sudo docker run --rm -v ~/Git/_/synclibrivox/books:/books -it scillidan/syncabook:latest to_xhtml books/alice_in_wonderland/plaintext books/alice_in_wonderland/sync_text
sudo docker run --rm -v ~/Git/_/synclibrivox/books:/books -it scillidan/syncabook:latest sync books/alice_in_wonderland
sudo docker run --rm -v ~/Git/_/synclibrivox/books:/books -it scillidan/syncabook:latest create books/alice_in_wonderland

reference

annex

syncabook.mp4 (bilibili)

Keyboard shortcuts

cheat

install

Windows 10

usage

reference

annex