Ultimate Data Extraction Prompt
I've refined this prompt to extract data from youtube videos, so I don't need to watch an 80 min video when I care about the key points and unique insights. (BUT if I have the time, I do prefer to watch or listen to the full video).
This prompt works best with GPT-o1, from what I have tested. I had a similar version back when I was using gpt 3.5, but I evolved the prompt over time.
*Note: after the prompt, you copy-paste the transcript of the text, could be any kind of text.
**Note: copy-paste your transcript/text at the end
***Note: you can also do cool stuff, like using notebook LLM to recreate the text under a podcast version... so If you initially had a video that was maybe 80 min long, you could end up with 20 min if you do this processing with notebookllm, focused on the most crucial points of the entire inital 80 min conversation.. it's a pretty cool toy.
Task:
Analyze a transcript to extract unique lessons and rarely expressed wisdom.
Objective:
Uncover specific insights and pieces of wisdom that are not commonly found in other sources. Ensure comprehensive clarity and preserve all essential and unique content without repetition or logical overlaps.
Key Focus Areas:
Comprehensive Clarity: Retain and clearly present all essential content.
Preservation of Unique Content: Identify and separately list any unique concepts or ideas.
Instructions:
Stage 1: Transcript Analysis
Thorough Review:
Action: Carefully read the entire transcript to fully understand the content and context.
Goal: Grasp the speaker's messages, tone, and underlying themes.
Identify Key Themes:
Action: Note down the primary themes, topics, and messages conveyed by the speaker.
Goal: Establish the foundational structure for analysis.
Extract Lessons Learned:
Highlight Key Lessons:
Action: Summarize the main lessons shared in the transcript.
Goal: Identify practical and applicable insights.
Ensure Practicality:
Action: Confirm that these lessons can be applied in real-world scenarios.
Goal: Enhance the usefulness of the extracted lessons.
Identify Rare Wisdom:
Focus on Uniqueness:
Action: Extract insights that are unique, unconventional, or rarely discussed.
Goal: Highlight novel ideas that add distinct value.
Nuanced Perspectives:
Action: Pay attention to nuanced or atypical advice provided.
Goal: Capture depth and sophistication in the speaker’s insights.
Avoid Repetition:
Eliminate Overlaps:
Action: Ensure that repeated ideas or concepts within the transcript are consolidated.
Goal: Prevent redundancy and maintain clarity.
Consolidate Information:
Action: Merge similar points into single, cohesive insights.
Goal: Streamline the extracted information.
Format Extracted Information:
Bulleted Lists:
Action: Present key lessons in a clear, bulleted list format.
Goal: Enhance readability and organization.
Separate Sections:
Action: Differentiate between common knowledge and unique wisdom in separate sections.
Goal: Distinguish widely known insights from novel ideas.
Provide Context:
Action: Include examples or contextual information to enhance understanding where necessary.
Goal: Offer clarity and practical relevance.
Verification:
Cross-Check Originality:
Action: Verify the uniqueness of the extracted insights against common sources.
Goal: Ensure the originality of the wisdom.
Validate Applicability:
Action: Ensure the rare wisdom is authentic and applicable.
Goal: Confirm the practical value of the insights.
Stage 2: Detailed Reporting
Select Compelling Details:
Action: Choose the most relevant and compelling details from Stage 1 without overlaps.
Goal: Focus on the most impactful insights.
Rank by Importance:
Action: Rank the extracted lessons and wisdom in terms of importance and logical hierarchy.
Goal: Prioritize insights based on their significance and interrelation.
Integrate Insights:
Action: Combine and integrate all insights into a single, cohesive narrative or procedure.
Goal: Create a unified and actionable framework without redundancies.
Structured Format:
Action: Present each unique insight along with its context and practical application in a clear, structured format.
Goal: Facilitate easy comprehension and implementation.
Stage 3: Final Consolidation
Holistic Review:
Action: Re-examine the entire transcript alongside the reports from Stages 1 and 2.
Goal: Gain a comprehensive and holistic understanding of the content.
Eliminate Redundancies:
Action: Remove any redundant information while retaining all unique details and key concepts.
Goal: Ensure the final output is concise and free of unnecessary repetition.
Implement Cross-References and Summaries:
Action: Use cross-references and iterative summaries to safeguard against the loss of essential content.
Goal: Maintain the integrity and completeness of the extracted information.
Organize into Clear Sections:
Action: Structure the output into well-defined sections with descriptive titles, each covering specific themes or concepts.
Goal: Enhance readability and ensure comprehensive clarity.
Stage 4: Final Refinement (If Needed)
Review for Missing Details:
Action: Compare the final output against the original transcript to identify any missing details that could enhance understanding.
Goal: Ensure all relevant information is included.
Enrich with Relevant Details:
Action: Incorporate additional relevant details from the transcript to preserve the original meanings of every idea.
Goal: Enhance the depth and richness of the final output.
Final Output Requirements:
Single Cohesive Block:
Action: Present the final output as one cohesive block of text, split into themes and unique ideas/concepts without references to processing stages.
Goal: Provide a seamless and integrated analysis.
Logical and Hierarchical Structuring:
Action: Ensure all information is organized logically and, if applicable, hierarchically.
Goal: Facilitate easy comprehension and application.
Separate Unique Concepts/Ideas:
Action: List any identified unique concepts or ideas separately, either within the main output or as an additional output if necessary.
Goal: Highlight novel insights distinctly.
Procedure for Learning:
Action: Present the extracted information in a natural learning format, making it concise yet comprehensive.
Goal: Enable effective learning while preserving all crucial information and unique wisdom.
Final Cleanup:
Eliminate Processing Stage References:
Action: Remove any mentions of PART 1, PART 2, PART 3, or stages from the final output.
Goal: Ensure the output is a unified, clean review without internal processing references.
Ensure Logical and Hierarchical Structure:
Action: Verify that the final output is structured logically and hierarchically where applicable.
Goal: Maintain organization and clarity.
Usage Instructions:
Provide the Transcript:
Insert the transcript text after the prompt where indicated.
Execute the Prompt:
The system will process the transcript according to the above instructions, ensuring all unique and essential content is extracted and well-organized.
Review the Output:
The final output will present a clear summary of lessons learned, unique wisdom, and contextual information, structured logically for effective understanding and application.
TEXT FOR YOU TO WORK ON starts NOW:
[Insert Transcript Here]