DataTopics Podcast
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that should flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!

#80 AI Agents Run Wild, DeepSeek Breaks Records, Polars Cloud Expands, and Perplexity Reinvents Search
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. DataTopics Unpluggedis your go-to spot for relaxed discussions on tech, news, data, and society.
This week, we’re unpacking everything from AI-powered vacations (or the lack thereof) to corporate drama, and even a deep dive into the quirks of COBOL. Join Morillo, Bart, and Alex as they navigate the latest happenings in data and tech, including:
- Airbnb AI: The CEO of Airbnb thinks AI trip planning is still a pipe dream. Is he right?
- Anthropic’s next AI model: A new Claude model could be just weeks away, promising a hybrid of deep reasoning and speed.
- OpenAI’s roadmap: Sam Altman lays out vague but ambitious plans, blurring the lines between AI models.
- Elon vs. OpenAI: Musk offers $97B for OpenAI, Altman claps back. Just another day in AI power struggles.
- RIP Viktor Antonov: The legendary art lead behind Half-Life 2 and Dishonored passes away at 52.
- Project Sid AI agents: 1,000 AI agents left to their own devices in Minecraft… What could go wrong?
- DeepSeek R1 breaks speed records: The latest AI model boasts a staggering 198 tokens per second.
- Perplexity’s Deep Research is now free: A game-changer for AI-powered search? We discuss.
- COBOL and the mystery of 1875-05-20: Why do old systems default to weird dates?
- Polars Cloud: A new distributed architecture to run Polars anywhere.
- Pickle AI avatars: Deepfake yourself into meetings. Ethical? Useful? Just plain weird?
- Vim after Bram: How the legendary text editor is surviving after its creator’s passing.
- Working Fast and Slow: A take on productivity, deep focus, and why some days just don’t work.
- We were wrong about GPUs: Fly.io admits they misjudged the demand for GPU-powered workloads.
#79 The $6 AI Model? France’s $85B Bet, DeepSeek's Censorship & The Python Upgrades You Need
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. DataTopics Unpluggedis your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that should flow as smoothly as your morning coffee (but don’t), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!
This week, we break down some of the biggest developments in AI, investments, and automation:
- France’s AI Boom: $85 billion in investments – A look at how a mix of international and domestic funds is fueling France’s AI ecosystem, and why Mistral AI might be Europe's best shot at competing with OpenAI.
- Anthropic’s AI Job Index: Who’s using AI at work? – A deep dive into the latest report on how AI is being used in different industries, from software development to education, and the surprising ways automation is creeping into unexpected jobs.
- The $6 AI Model: How low can costs go? – Researchers have managed to create a reasoning model for just $6. We unpack how they pulled it off and what this means for the AI landscape.
- AI Censorship & Model Distillation: What’s really going on? – A discussion on recent claims that certain AI models come with baked-in censorship, and whether fine-tuning is playing a bigger role than we think.
- PromptLayer’s No-Code AI Tools – Are no-code AI development platforms the next big thing?
- Predicted Outputs: OpenAI’s approach to efficient code editing – A look at how OpenAI’s "Predicted Outputs" feature could make AI-assisted coding more efficient.
- MacOS System Monitoring & Dev Tooling: The geeky stuff – A breakdown of system monitoring tools for Mac users who love to keep an eye on every process running in the background.
- Snapshot Testing with Birdie – Exploring the concept of snapshot testing beyond UI testing and into function outputs.
- BeeWare & the Python Ecosystem – A look at how BeeWare is helping Python developers build cross-platform applications.
- Astral, Ruff, and UV: Python’s performance evolution – The latest from Charlie Marsh on the tools shaping Python development.
#78 The AI Act Lands, Meta Pauses, OpenAI Complains & DeepSeek Rises
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Data Topics Unpluggedis your go-to spot for relaxed discussions on tech, news, data, and society.
This week, we’re joined by returning guest Tim Leers, who helps us navigate the ever-evolving landscape of AI regulation, open-source controversies, and the battle for the future of large language models.
Expect deep dives, hot takes, and a sprinkle of existential dread as we discuss:
- The EU AI Act and its ripple effects – What does it actually change? And is Meta pulling back on AI development because of it?
- Meta’s “Frontier AI” framework – A strategic move or just regulatory camouflage?
- OpenAI vs. the world – From copyright drama to OpenAI accusing competitors of using its models, is this just karma in action?
- DeepSeek and global AI competition – Why are government agencies banning it, and is it really a game-changer?
- The EU’s AI investment plans – Can Europe ever catch up, or is 1.5 billion euros just a drop in the compute ocean?
- OpenAI’s sudden love for open source – Sam Altman says they were on the "wrong side of history." Are they really changing, or is this just another strategic pivot?
- OpenAI’s latest tech update – we discuss Tim’s experience with o3 and show it live
All that, plus some existential musings on AI’s role in society, competitive dynamics between the US, EU, and China, and whether we’re all just picking our preferred bias in a world of competing LLMs.
Got thoughts? Drop us a comment or question—we might even read it on the next episode!
#77 DeepSeek R1: The ‘Open’ AI That’s Shaking Up OpenAI - Plus OpenAI’s Operator, Stargate, ByteDance, & more
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. DataTopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
This week, we’re joined by Jonas Soenen, a machine learning engineer at Dataroots, to break down the latest AI shakeups—from DeepSeek R1 challenging OpenAI to new AI automation tools that might just change how we use the internet. Let’s dive in:
DeepSeek R1: Open-source revolution or just open weights? – A new AI model making waves with transparency and cost efficiency. But is OpenAI really at risk?
Reinforcement learning, no tricks needed – How DeepSeek R1 trains without complex search trees or hidden techniques—and why that’s a big deal.
Web LM Arena’s leaderboard – How DeepSeek R1 ranks against OpenAI, Anthropic, and other top models in real-world coding tasks.
Kimi – Another promising open-weight model challenging the AI giants. Could this be the real alternative to GPT-4?
Open-source AI and industry reactions – Why are companies like OpenAI hesitant to embrace open-source AI, and will DeepSeek’s approach change the game?
ByteDance’s surprise AI play – The TikTok parent company is quietly building its own powerful AI models—should OpenAI and Google be worried?
OpenAI’s Stargate project – A massive $500B AI infrastructure initiative—how does this impact AI accessibility and competition?
OpenAI’s Operator: Your new AI assistant? – A browser-based agent that can shop for you, browse the web, and click buttons—but how secure is it?
Midscene & UI-TARS Desktop – AI-powered automation tools that might soon replace traditional workflows.
Nightshade – A new method for artists to poison AI training data, protecting their work from unauthorized AI-generated copies.
Nepenthes – A tool designed to fight back against LLM text scrapers—could this help protect data from being swallowed into future AI models?
AI in music: Paul McCartney vs. AI-generated songs – The legendary Beatle wants stronger copyright protections, but is AI creativity a threat or a tool?
📢 Note: Recent press coverage has clarified key details. Training infrastructure and cost figures mentioned were for DeepSeek V3—DeepSeek R1’s actual training costs have not been officially disclosed.
#76 AI at what cost? Environmental toll, Trump vs AI regulation, creative impact, & poisoned text for AI scrapers.
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that should flow as smoothly as your morning coffee (but don’t), where industry insights meet laid-back banter. Whether you’re a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let’s get into the heart of data, unplugged style!
This week, we dive into:
- The creative future with AI: is generative AI helping or hurting creators?
- Environmental concerns of AI: the hidden costs of AI’s growing capabilities—how much energy do these models actually consume, and is it worth it?
- AI copyright controversies: Mark Zuckerberg’s LLaMA model faces criticism for using copyrighted materials like content from the notorious LibGen database.
- Trump vs. AI regulation: The former president repeals Biden’s AI executive order, creating a Wild West approach to AI development in the U.S. How will this impact innovation and global competition?
- Search reimagined with Perplexity AI: A new era of search blending conversational AI and personalized data unification. Could this be the future of information retrieval?
- Apple Intelligence on pause: Apple's AI-generated news alerts face a bumpy road. For more laughs, check out the dedicated subreddit AppleIntelligenceFail.
- Rhai scripting for Rust: Empowering Rust developers with an intuitive embedded scripting language to make extensibility a breeze.
- Poisoned text for scrapers: Exploring creative ways to protect web content from unauthorized scraping by AI systems.
- The rise of the AI Data Engineer: Is this a new role in data science, or are we just rebranding existing skills?
#75 Developer Productivity in 2025: AI Replaces Engineers, Biden’s AI Chip Regulations, UV’s Killer Feature, and Doom in a PDF
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
In this episode, we delve into the big topics shaping our digital landscape:
- Car Expo - Brussels Motor Show: Highlights from Europe’s leading auto show, including Tesla’s Cybertruck debut and an innovative AI-powered car configurator that personalizes your vehicle experience.
- Biden Admin’s New AI Chip Export Rules: Exploring restrictions aimed at national security and their impact on global markets, with industry reactions from Nvidia.
- Meta and Microsoft’s AI Development Plans: From Meta’s goal to replace mid-level engineers with AI to Microsoft forming a dev-focused AI organization, we unpack their strategies and implications.
- Developer Productivity in 2025: How AI tools are changing workflows, boosting efficiency, and introducing new challenges.
- UV’s Killer Feature: Discover how ad-hoc environments are transforming development, courtesy of Lukas Valatka's insights.
- Doom in a PDF: Yes, you read that right—Doom running inside a PDF! Here’s the source code for all the geeks out there.
- Marimo: An exciting new project redefining collaborative development.
- AI and Everyday Life: A witty meme highlights AI’s direction—should it help with art and writing, or chores like laundry and dishes?
#74 Hello 2025! OpenAI’s O3, Deep Seek V3, Bolt.new and Doom Goes Artsy
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!
In this episode, we explore:
- OpenAI’s O3: Features, O1 Comparison, Release Date & more.
- Advent of Code: How LLMs performed on the 2024 coding challenges.
- DeepSeek V3: A breakthrough AI model developed for a fraction of GPT-4’s cost, yet rivaling top benchmarks.
- Shadow Workspace: How Cursor compares to Copilot with features like integrated models, documentation, and search.
- Bolt.new: Why it’s poised to revolutionize web app development with prompt-driven innovation.
- O1 Preview’s Chess Hack: When smarter means “cheater” in a fascinating experiment against Stockfish.
- Pydantic AI: A new tool bringing structure and intelligence to Python’s AI workflows.
- RightTyper: A tool to infer and apply type hints for cleaner, more efficient Python code.
- Doom: The Gallery Experience: A whimsical take on art appreciation in a retro gaming environment.
- Suno V4: The next-gen music generator, featuring "Bart, the Data Dynamo."
- Ghostty Terminal: The terminal emulator developers are raving about.
#73 LLM Hunger Games: The Ultimate Showdown - Rootsconf recap (Part 3)
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!
In this episode, we wrap up the Rootsconf mini-series with a thrilling finale with Sophie De Coppel and Warre Dreesen's workshop from our internal knowledge-sharing event:
- AI Hunger Games: A showdown between AI language models like GPT-4, Claude, and Gemini. Who aced coding, games, and social interactions?
- Human vs. Machine: Fun experiments like “Find the Human” and “The Chameleon Game” highlight where humans and AI shine—and stumble.
- Model Personalities Explored: Discover why some models seem nerdy, others boastful, and how creativity plays a role in performance.
- Engineering Insights: Behind-the-scenes on implementing and testing AI models in competitive scenarios, from advent-of-code puzzles to group chat debates.
Join the fun as hosts and guests break down the playful and thought-provoking ways we’re pushing AI to its limits. Let the games begin!
#72 Mastering Communication in the Workplace – Rootsconf Recap (Part 2)
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that flow as smoothly as your morning coffee (but don’t), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!
In this episode:
Special guest Bram Decoster shares his journey and practical wisdom on developing charisma and confidence. We explore:
- The foundations of charisma: How presence, power, and warmth shape effective communication.
- Overcoming discomfort: Actionable strategies to tackle mental and physical barriers to confidence.
- Public speaking tips: Practical advice for managing nerves and connecting with your audience.
- Practical takeaways: Insights from "The Charisma Myth" by Olivia Fox Cabane, including visualization exercises and mindset shifts.
- Why charisma matters in data work: The intersection of technical expertise and interpersonal influence in the workplace.
#71 Navigating GenAI: How Organizations Must Adapt to Paradigm Shifts – Rootsconf Recap (part 1)
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
This week, we’re bringing you a special episode straight from RootsConf, our annual internal knowledge-sharing extravaganza! Hosts Murilo and Bart sit down with Tim and Ben, data strategy experts, for a lively chat about the state of generative AI as it transitions from a buzzword to a business tool.
Highlights from this episode:
- Generative AI adoption: Are companies finally moving beyond pilot purgatory?
- The environmental cost of AI: Can emerging techniques reduce its heavy energy footprint?
- Bridging the knowledge gap: What’s missing for widespread AI adoption in organizations?
- Future trends: How generative AI might reshape personalization and business processes in 2025.
Plus, we dive into the Gartner Hype Cycle and its relevance in understanding AI’s journey from innovation to disillusionment and beyond.
Get ready to dive deep into AI’s evolving role and its impact on industries, sustainability, and society. Hit play and join the discussion!
#70 What's Next for AI? A Recap of 2024 and Predictions for 2025
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
This week, Yannick joins the conversation for a lively year-end retrospective on the state of AI, data, and technology in 2024. Whether you're knee-deep in neural networks or just data-curious, this episode offers plenty to ponder.
Grab your coffee, sit back, and explore:
- AI’s meteoric rise in 2024: How GenAI went from hype to tangible business tools and what’s ahead for 2025.
- Strategic AI adoption: Challenges and best practices for embedding AI into workflows and decision-making processes.
- Real-time data: From dynamic pricing to e-commerce triggers, we explore gaps and future trends in event-driven infrastructure.
- The ethics and compliance puzzle: A dive into the EU AI Act, data privacy, and the evolving landscape of ethical AI usage.
- Developer tools and trends: Productivity boosters like Copilot and the rise of tools like PDM and Ubi in the Python ecosystem.
With reflections on everything from Lakehouse data platforms to open-source debates, this episode is the perfect blend of geeky insights and forward-looking predictions.
Pull up a chair, relax, and let’s dive into the world of data, unplugged style!
#69 From Engineer to CEO: Alex Gallego on Building Red Panda
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
In this episode, we’re joined by a special guest: Alex Gallego, founder and CEO of Red Panda. Together, we dive deep into building data-intensive applications, the evolution of streaming technologies, and balancing high throughput and low latency demands.
Key topics covered:
- What is Red Panda and why it matters: Red Panda’s mission to redefine data streaming while being the fastest Kafka-compatible option on the market.
- Batch vs. streaming data: An accessible guide to understanding the classic debate and how the tech landscape is shifting towards unified data frameworks.
- Scaling at speed: The challenges and innovations driving Red Panda’s performance optimizations, from zero-copy architecture to storage engines.
- AI, ML, and streaming data integration: How Red Panda empowers real-time machine learning and AI-powered workloads with ease.
- Open source vs. enterprise models: Navigating licensing challenges and balancing business goals in the hybrid cloud era.
- Leadership and career shifts: Alex’s reflections on moving from technical lead to CEO, blending engineering know-how with company vision.
#68 GenAI meets Minecraft, OpenAI’s O1 Leak, Strava’s AI Moves, HTMX vs. React & Octoverse Trends
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that should flow as smoothly as your morning coffee (but don’t), where industry insights meet laid-back banter. Whether you’re a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let’s get into the heart of data, unplugged style!
In this episode, we are joined by special guest Nico for a lively and wide-ranging tech chat. Grab your headphones and prepare for:
- Strava’s ‘Athlete Intelligence’ feature: A humorous dive into how workout apps are getting smarter—and a little sassier.
- Frontend frameworks: HTMX is a tough choice: A candid discussion on using React versus emerging alternatives like HTMX and when to keep things lightweight.
- Octoverse 2024 trends and language wars: Python takes the lead over JavaScript as the top GitHub language, and we dissect why Go, TypeScript, and Rust are getting love too.
- GenAI meets Minecraft: Imagine procedurally generated worlds and dreamlike coherence breaks—Minecraft-style. How GenAI could redefine gameplay narratives and NPC behavior.
- OpenAI’s O1 model leak: Insights on the recent leak, what’s new, and its implications for the future of AI.
- Tiger Beetle’s transactional databases and testing tales: Nico walks us through Tiger Style, deterministic simulation testing, and why it’s a game changer for distributed databases.
- Automated testing for LLMOps: A quick overview of automated testing for large language models and its role in modern AI workflows.
- DeepLearning.ai’s short courses: Quick, impactful learning to level up your AI skills.
#67 The AI Race: ChatGPT's New Web Search, Meta’s Llama AI Scaling Efforts & Python 3.13's Upgrades
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that should flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!
In this episode, we cover:
- ChatGPT Search: Exploring OpenAI's new web-browsing capability, and how it transforms everything from everyday searches to complex problem-solving.
- ChatGPT is a Good Rubber Duck: Discover how ChatGPT makes for an excellent companion for debugging and brainstorming, offering more than a few laughs along the way.
- What’s New in Python 3.13: From the new free-threaded mode to the just-in-time (JIT) compiler, we break down the major (and some lesser-known) changes, with additional context from this breakdown and Reddit insights.
- UV is Fast on its Feet: How the development of new tools impacts the Python packaging ecosystem, with a side discussion on Poetry and the complexities of Python lockfiles.
- Meta’s Llama Training Takes Center Stage: Meta ramps up its AI game, pouring vast resources into training the Llama model. We ponder the long-term impact and their ambitions in the AI space.
- OpenAI’s Swarm: A new experimental framework for multi-agent orchestration, enabling AI agents to collaborate and complete tasks—what it means for the future of AI interactions.
- PGrag for Retrieval-Augmented Generation (RAG): We explore Neon's integration for building end-to-end RAG pipelines directly in Postgres, bridging vector databases, text embedding, and more.
- OSI’s Open Source AI License: The Open Source Initiative releases an AI-specific license to bring much-needed clarity and standards to open-source models.
We also venture into generative AI, the future of AR (including Apple Vision and potential contact lenses), and a brief look at V0 by Vercel, a tool that auto-generates web components with AI prompts.
#66 From Will Smith to Meta's MovieGen: How AI Video Got Real. Plus Claude 3.5’s “Computer Use” & Open Source Tools
Welcome to Datatopics Unplugged, where the tech world’s buzz meets laid-back banter. In each episode, we dive into the latest in AI, data science, and technology—perfect for your inner geek or curious mind. Pull up a seat, tune in, and join us for insights, laughs, and the occasional hot take on the digital world.
In this episode, we are joined by Vitale to discuss:
Meta’s video generation breakthrough: Explore Meta’s new “MovieGen” model family that generates hyper-realistic, 16-second video clips with reflections, consistent spatial details, and multi-frame coherence. Also discussed: Sora, a sneak peek at Meta’s open-source possibilities.
For a look back, check out this classic AI-generated video of Will Smith eating spaghetti.
Anthropic’s Claude 3.5 updates: Meet Claude 3.5 and its “computer use” feature, letting it navigate your screen for you.
Easily fine-tune & train LLMs, faster with Unsloth: Discover tools that simplify model fine-tuning and deployment, making it easier for small-scale developers to harness AI’s power. Don’t miss Gerganov’s GitHub contributions in this space, too.
Deno 2.0 release hype: With a splashy promo video, Deno’s JavaScript runtime enters the scene as a streamlined, secure alternative to Node.js.