Yet another article To Write


What we're gonna do:

  1. Create a dataset of images by scraping an epub using pyautogui
  2. Convert those images into text using tesseract OCR
  3. Traing an RNN on this text and generate novel text that should resemble the initial one, using the already avaible word-rnn repo