Learn the differences between offline and online machine learning, how one can complement the other, and streaming concepts and best practices to start your online ML journey with River, an open source Python ML library, in this short talk by Tun Shwe at this year's Berlin Buzzwords. #bbuzz
We are thrilled to announce that Elastic is once again a Silver Partner for Berlin Buzzwords. Find out more about them on their website https://www.elastic.co/. #bbuzz
Join Ashish Khatkar at this year's Berlin Buzzwords for his talk on how Yelp has evolved its streaming data platform at scale while supporting its growing business needs coupled with the rapidly changing streaming landscape. #bbuzz
GenAI is rapidly transforming organisations. But how do we scale from individual experiments to enterprise-grade business products? Join Sebastian Arnold at this year's Berlin Buzzwords as he discusses insights from hundreds of use cases in the pharmaceutical industry on user intentions, information access patterns and quality metrics. #bbuzz
You've heard the wisdom of choosing boring technology, but how do you balance that with new usage patterns and ever-increasing scale? Join Varun Thacker and Bryan Burkholder at this year's Berlin Buzzwords to learn how Slack built a petabyte-scale log search engine with a team of just three engineers, supporting over a million queries per day. #bbuzz
We are proud to announce that Kleinanzeigen and mobile.de are once again our Gold Partners for Berlin Buzzwords. Learn more about them by visiting their websites https://www.mobile.de/careers/ & https://kleinanzeigen.de/careers/. #bbuzz
Join Hellmar Becker at this year's Berlin Buzzwords to learn how to track data lineage in a real-time, open source analytics pipeline. #bbuzz
The #CfP for MICES is only open for a few more days and ends this Sunday, April 28th, 23:59 CEST! Submit your proposal on all things e-commerce search now and visit MICES on June 12th, the day after #bbuzz! https://mices.co
Many of the supposedly novel challenges of building systems around LLMs are analogous to problems we've solved for conventional ML systems. Join William Benton at this year's Berlin Buzzwords to learn why the things you already know about building ML systems are still relevant to LLM systems - and where the real novelty of LLMs lies. #bbuzz
The Open Web Search Initiative (OWS.eu) is a European initiative providing an open platform to foster innovation in search and AI applications. At this year's Berlin Buzzwords, join @digitalpebble and Michael Dinzinger for an introduction to OWS, its public datasets, and an introduction to two of the open source projects that underpin it - StormCrawler and URLFrontier. #bbuzz
Vector databases give you the infrastructure to store the embeddings, but how those embeddings are made is the most innovative part of all. Join Sonam Pankaj at this year's Berlin Buzzwords for her short talk on metric learning. #bbuzz
Like last year, we are delighted to introduce you to our Platinum Partner - OpenSearch Project! Find out more about their great work and the upcoming OpenSearchCon Europe 2024 in Berlin on their website. https://opensearch.org/
At this year's Berlin Buzzwords, join Kentaro Takiguchi for his talk on integrating semantic search into an established lexical search system, addressing potential challenges and pitfalls, and evaluating different optimisation methods and their varying effects on metrics by exploring and enhancing lexical and semantic search in practical scenarios. #bbuzz
This year at #bbuzz, Jarek Potiuk will discuss the role of orchestrators in the modern data stack and introduce the new orchestrator in town: "Apache Airlfow 2.x" with ways of orchestration you had not realized you can do!
Join @saahil at this year's Berlin Buzzwords and explore the evolving role of product management in open source generative AI, with a particular focus on the industry impact of Retrieval Augmented Generation (RAG). Learn about community-commercial balance, innovative AI product discovery, and sustainable monetisation strategies. #bbuzz
Join Daniele Antuzi at this year's Berlin Buzzwords to explore the architecture and implementation of a serverless MapReduce indexer designed for Apache Solr, but extendable to any search engine. Learn the principles of MapReduce, a programming model for processing large datasets, and how to adapt it for indexing documents in Apache Solr. #bbuzz
Curious about where unexpectedly deep learning is being used these days? Join Pere Urbon Bayes for his talk at this year's #bbuzz and learn how LSTMs, RNNs, Autoencoders or Transformers are being used to help teams analyse their opponents, make real-time decisions or even evaluate player performance in handball.
Dense vector search is not the only way to improve your search relevance. Join Hajer Bouafif and Praveen Mohan Prasad at this year's #bbuzz as they discuss advanced methods for improving keyword search using machine learning and large language models.
At this year's #bbuzz, join Joel Knighton for a speed run through the last ten years of research and development in approximate nearest neighbour search algorithms and vector databases, covering the major advances, the current state of the art, and possible future directions.
Curious about NLP beyond the startup hype? Join Emanuele Lapponi and Murhaf Fares at this year's #bbuzz and explore NLP in a 'traditional' setting. Tackle challenges such as data scarcity and domain specificity using e.g. data augmentation and zero-shot classification, and learn some tips and tricks to tackle concrete and relatable NLP problems.