open-notebooklm / README.md
gabriel chua
update README
9149e8d
|
raw
history blame
1.85 kB

Open PDF to Podcast

Overview

This project provides a tool to convert any PDF document into a podcast episode! Leveraging open-source LLMs and text-to-speech models, this tool processes the content of a PDF, generates a natural dialogue suitable for an audio podcast, and outputs it as an MP3 file.

Features

  • Convert PDF to Podcast: Upload a PDF and convert its content into a podcast dialogue.
  • Engaging Dialogue: The generated dialogue is designed to be informative and entertaining.
  • User-friendly Interface: Simple interface using Gradio for easy interaction.

Installation

To set up the project, follow these steps:

  1. Clone the repository:

    git clone https://github.com/gabrielchua/open-pdf2podcast.git
    cd pdf-to-podcast
    
  2. Create a virtual environment and activate it:

    python -m venv .venv
    source .venv/bin/activate
    
  3. Install the required packages:

    pip install -r requirements.txt
    

Usage

  1. Set up API Key(s): For this project, I am using LLama 3.1 405B hosted on Fireworks API as its JSON Mode supports passing a pydantic object. So, please set the API key as the FIREWORKS_API_KEY environment variable

  2. Run the application:

    python main.py
    

    This will launch a Gradio interface in your web browser.

  3. Upload a PDF: Upload the PDF document you want to convert into a podcast.

  4. Generate Audio: Click the button to start the conversion process. The output will be an MP3 file containing the podcast dialogue.

Acknowledgements

This project is forked from knowsuchagency/pdf-to-podcast

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for more information.