Efficiently navigating long-form content like podcasts, webinars, and tutorials can be challenging without proper structure. Generating chapters solves this problem by dividing content into logical sections, enabling better accessibility and usability. Sieve's Transcript Analysis Function simplifies this process with two powerful features: Chapter Generation and Custom Chapters.
In this post, we’ll explore how these modes work, why they are valuable, and provide a quick guide to implementing them.
Generating chapters for content provides several benefits for both creators and consumers:
By automating this process with Sieve’s Transcript Analysis Function, you can save time and effort while delivering high-quality results.
Sieve's Transcript Analysis function offers two versatile modes for chapter creation:
This mode uses LLM-powered analysis to automatically segment your video or audio into logical chapters based on the transcript.
Ideal for:
This mode provides greater control by letting you manually define chapter titles or use an LLM to generate them.
Best for:
Available Modes:
Users can choose from various transcription backends like "groq-whisper" (default), "stable-ts," and "whisper-timestamped," as well as LLM backends including "gpt-4o-2024-08-06" (default), "gpt-4o-2024-05-13," and "gpt-4o-mini."
Here’s a code snippet for automatically generating chapters from a video or audio file:
import sieve
file = sieve.File(url="https://storage.googleapis.com/sieve-prod-us-central1-public-file-upload-bucket/c4d968f5-f25a-412b-9102-5b6ab6dafcb4/52e169ed-9b30-4241-a956-591f2c6c30ab-chatgpt_tutorial_charlieChang.mp4")
transcription_backend = "groq-whisper"
llm_backend = "gpt-4o-2024-08-06"
transcript_analysis = sieve.function.get("sieve/transcript-analysis")
output = transcript_analysis.run(file,
transcription_backend,llm_backend, prompt="",
generate_summary=False, generate_title=False,
generate_tags=False, generate_chapters=True)
output_list = list(output)
for chapter in output_list[2]['chapters']:
print(f'{chapter['title']}:{chapter['timecode']} \n')
Ellen and Brian | JENNIE 'Mantra' Dance Tutorial
File Duration- 11:34
Time taken to generate chapters- 13s
Chapters | Time-stamp |
---|---|
Introduction | 00:00 |
Dance Tutorial Begins | 00:18 |
Hand Movements | 00:34 |
Adding Feet Movement | 01:39 |
Combining Movements | 02:05 |
Second Eight Count | 03:06 |
Third Eight Count | 05:39 |
Fourth Eight Count | 08:01 |
Conclusion | 11:11 |
Comment: The "Conclusion" chapter is not an appropriate title and is not time-synced correctly. All other generated chapters are excellent.
Charlie Chang | ChatGPT Tutorial: How to Use Chat GPT For Beginners 2024
File Duration- 27:05
Time taken to generate chapters- 24s
Chapters | Time stamp |
---|---|
Introduction | 00:00 |
Account Setup | 00:59 |
Limitations of ChatGPT | 01:53 |
Basic Use of Prompts | 02:32 |
Command Type 1: Facts | 02:34 |
Command Type 2: Suggestions | 03:51 |
Ad Break | 04:58 |
Command Type 3: Language Translation | 05:26 |
Command Type 4: Creative Writing | 09:33 |
Command Type 5: Text Summarization | 14:23 |
Command Type 6: List Generation | 15:43 |
Command Type 7: Pros and Cons | 16:47 |
Command Type 8: Studies and Quotes | 17:29 |
Command Type 9: Feedback and Improvement | 18:40 |
Command Type 10: Role Playing | 19:55 |
Intro to ChatGPT & Demonstrations | 21:37 |
Command Prompts for ChatGPT | 22:53 |
Tips for Effective Use | 25:54 |
Conclusion and Outro | 27:03 |
Comment: Excellent!
Lex Fridman Podcast | Aravind Srinivas: Perplexity CEO on Future of AI, Search & the Internet
File Duration- 3:02:15
Time taken to generate chapters- 2m 10s
Chapters | Time-stamp |
---|---|
Exploring the Realms of Conversational AI: From Introduction and Concepts to Technical Insights and Functionality | 00:00:00 |
Challenges and Strategies in AI, User Experience, and Product Design | 00:04:12 |
Business Models & Market Dynamics | 00:09:13 |
The Future of AI and Search Engines | 00:13:13 |
User Experience Introduction and Considerations of Ad Breaks | 00:29:36 |
Challenges in AI Output and Larry Page's Influence | 00:34:03 |
Building a Product and Adapting to User Needs | 00:38:22 |
Design Decisions | 00:43:03 |
Influential Entrepreneurs: The Success of Bezos and Elon | 00:46:52 |
The Impact of Open Source in AI and The Recipe for ChatGPT | 00:56:13 |
Enhancing Post-Training Reasoning in Small Models with RLHF | 01:11:07 |
Chain of Thought Reasoning | 01:16:06 |
Future of AI Reasoning | 01:20:14 |
The Intersection of Human Curiosity and AI: An Introduction to AI and Compute | 01:24:20 |
AGI and Inference Limits | 01:32:23 |
Ad Break | 01:36:38 |
Perplexity and LLM Products | 01:41:21 |
Challenges of Web Search Transformation | 01:52:32 |
Exploring Truth and Prospects in AI Models and RAG | 01:56:25 |
Improving Indexing and Snippets | 02:01:15 |
Startup Challenges and Advice | 02:21:25 |
Inspiration, Hard Work, and the Path to Education and Exploration | 02:28:26 |
Ad Break, Search, and Knowledge Discovery: Navigating the Intersection of Marketing and Information Exploration | 02:32:00 |
Perplexity and Curiosity | 02:36:19 |
AI and Human Connection | 02:50:43 |
Navigating the Intersection of AI and Human Values: Challenges, Biases, and Hope | 02:55:42 |
Comment: Excellent! Employed
min_chapter_length=210s
parameter to control the number of chapters generated.
Sieve's chapter generation capabilities streamline content organization while enhancing accessibility and engagement. Start using the Transcript Analysis Function today to transform how your audience interacts with your content.
For support or questions, join our Discord community or contact us at contact@sievedata.com.