data-analysis
Airbnb
Mon Nov 25 2024
From Data to Insights: Segmenting Airbnb’s Supply
data-streaming
Booking
Building the Future of Content: Inside Booking.com’s Intelligent Content Enrichment Platform
Machine-Learning
Canva
How to improve search without looking at queries or results
How we improved Canva’s private design search while respecting the privacy of our community.
personalization
Wed Nov 20 2024
Building a User Signals Platform at Airbnb
engineering
Wed Nov 13 2024
Airbnb’s AI-powered photo tour using Vision Transformer
software-development
Tue Nov 12 2024
Self-Serve Platform for Scalable ML Recommendations
search-engines
Mon Nov 11 2024
Transforming Location Retrieval at Airbnb: A Journey from Heuristics to Reinforcement Learning
contextual-bandit
Expedia
Tue Nov 05 2024
Identifying Top-Scoring Arms in Ranking Bandits With Linear Payoffs in Real-Time
DDoS
Cloudflare
Wed Oct 23 2024
Training a million models per day to save customers of all sizes from DDoS attacks
In this post we will describe how we use anomaly detection to watch for novel DDoS attacks.
sagemaker
Wix
Tue Oct 22 2024
SageMaker Batch Transform Unleashed: My Journey at Wix to Achieve Scalable ML
Data
Spotify
Mon Oct 21 2024
How We Generated Millions of Content Annotations
With the fields of machine learning (ML) and generative AI (GenAI) continuing to rapidly evolve and expand, it has become [.
Pinterest
Fri Oct 11 2024
Ray Batch Inference at Pinterest (Part 3)
AI-and-ML
GitHub
Thu Oct 03 2024
How students teamed up to decode 2,000-year-old texts using AI
Students used GitHub Copilot to decode ancient texts buried in Mount Vesuvius, achieving a groundbreaking historical breakthrough.
Birthday-Week
Fri Sep 27 2024
AI Everywhere with the WAF Rule Builder Assistant, Cloudflare Radar AI Insights, and updated AI bot protection
reinforcement-learning
Thu Sep 26 2024
AI for Revolutionizing Customer Care Routing System at Wix
ai
Tue Sep 24 2024
Sandcastle: data/AI apps for everyone
pinterest
Fri Sep 20 2024
Feature Caching for Recommender Systems w/ Cachelib
CICD
Wed Sep 11 2024
Streamlining your MLOps pipeline with GitHub Actions and Arm64 runners
Explore how Arm’s optimized performance and cost-efficient architecture, coupled with PyTorch, can enhance machine learning operations, from...
reward-engineering
Netflix
Thu Aug 29 2024
Recommending for Long-Term Member Satisfaction at Netflix
mlops
Tue Aug 27 2024
Enabling Core Machine Learning Platform Capabilities
data-science
Mon Aug 26 2024
Improve Your Next Experiment by Learning Better Proxy Metrics From Past Experiments
machine-learning
Thu Aug 08 2024
NEP: Notification System and Relevance
Tue Jul 30 2024
How GitHub harnesses AI to transform customer feedback into action
Learn how we’re experimenting with open source AI models to systematically incorporate customer feedback to supercharge our product roadmaps...
experimentation
Fri Jul 26 2024
The Engineering Behind Booking.com’s Ranking Platform | A System Overview
Thu Jul 25 2024
Making WAF ML models go brrr: saving decades of processing time
In this post, we discuss the performance optimizations we've implemented for our WAF ML product.
research
Tue Jul 23 2024
Learning To Rank at Expedia Group: How to adapt the Property Search Result Page based on …
orchestration
Mon Jul 22 2024
Maestro: Netflix’s Workflow Orchestrator
Dropbox
Thu Jul 11 2024
Bringing AI-powered answers and summaries to file previews on the web
Wed Jul 10 2024
Building Pinterest Canvas, a text-to-image foundation model
fraud-prevention
Leverage graph technology for real-time Fraud Detection and Prevention
Bots
Wed Jul 03 2024
Declare your AIndependence: block AI bots, scrapers and crawlers with a single click
To help preserve a safe Internet for content creators, we’ve just launched a brand new “easy button” to block all AI bots.
Product-News
Mon Jun 24 2024
Using machine learning to detect bot attacks that leverage residential proxies
Computer-vision
Fri Jun 21 2024
How we see groups in design
eta-reliability
Lyft
Thu Jun 20 2024
ETA (Estimated Time of Arrival) Reliability at Lyft
video-editing
Wed Jun 19 2024
Video annotator: building video classifiers using vision-language models and active learning
Mon Jun 17 2024
Ray Infrastructure at Pinterest
learning-to-rank
Tue May 21 2024
Choosing the Right Candidates for Lodging Ranking
Fri Apr 26 2024
Airbnb Brandometer: Powering Brand Perception Measurement on Social Media Data with AI
Mon Apr 08 2024
Chronon, Airbnb’s ML Feature Platform, Is Now Open Source
LLM
Tue Mar 19 2024
Bye Bye Bye...: Evolution of repeated token attacks on ChatGPT models
Building on prior prompt injection research, we recently discovered a new training data extraction vulnerability involving OpenAI’s chat com...
statistics
Mon Mar 18 2024
Sequential Testing Keeps the World Streaming Netflix Part 2: Counting Processes
Tue Mar 12 2024
Lyft’s Reinforcement Learning Platform
Thu Mar 07 2024
Supporting Diverse ML Systems at Netflix
operational-efficiency
Mon Mar 04 2024
Evolving from Rule-based Classifier: Machine Learning Powered Auto Remediation in Netflix Data…
cloud-computing
PayPal
Wed Feb 21 2024
Leveraging Spark 3 and NVIDIA’s GPUs to Reduce Cloud Cost by up to 70% for Big Data Pipelines
Bot-Management
Fri Feb 16 2024
Monitoring machine learning models for bot detection
We recently shared an introduction to Cloudflare’s approach to MLOps, which provides a holistic overview of model training and deployment pr...
Thu Feb 15 2024
Safeguarding your brand identity: Logo Matching for Brand Protection
Brand Protection's Logo Matching feature enables users to upload an image of the user’s logo or other brand image.
platform
Tue Feb 13 2024
Powering ML Platform Orchestration and Experimentation
supervised-learning
Tue Jan 30 2024
Learning Embeddings for Lodging Travel Concepts
Thu Jan 18 2024
Handling Online-Offline Discrepancy in Pinterest Ads Ranking System
Tue Jan 09 2024
Evolution of Ads Conversion Optimization Models at Pinterest
Fri Dec 22 2023
Airbnb at KDD 2023
artificial-intelligence
LinkedIn
Wed Dec 20 2023
Enhancing Content Review: Proactively addressing threats with AutoML
Co-Authors: Shubham Agarwal and Rishi Gupta At LinkedIn, we work every day to deliver a safe and trusted experience for our members and cust...
Tue Dec 12 2023
Candidate Generation Using a Two Tower Approach With Expedia Group Traveler Data
Mon Dec 11 2023
Declarative Feature Engineering at PayPal
AI
Thu Dec 07 2023
ML Ops Platform at Cloudflare
Mon Nov 27 2023
Augmenting our content moderation efforts through machine learning and dynamic content prioritization
Co-Authors: Abhishek Chandak and Ritish Verma We recognize that our 1 billion members and their over 10 billion years’ worth of collective k...
creative-production
Sat Nov 25 2023
Causal Machine Learning for Creative Insights
Wed Nov 15 2023
Wisdom of Unstructured Data: Building Airbnb’s Listing Knowledge from Big Text Data
music-classification
Tue Nov 14 2023
Detecting Speech and Music in Audio Content
Algorithms
Fri Nov 10 2023
Ship Shape
Developer-Tools
Tue Nov 07 2023
How We Automated Content Marketing to Acquire Users at Scale
Spotify runs paid marketing campaigns across the globe on various digital ad platforms.
Mon Nov 06 2023
Building In-Video Search
Tue Oct 31 2023
Putting everything in its right place with ML-powered file organization
Smart move uses machine learning to analyze a user?s existing subfolder structure and suggest folders where they might want to move their fi...
Wed Oct 25 2023
Introducing Voyager: Spotify’s New Nearest-Neighbor Search Library
For the past decade, Spotify has used approximate nearest-neighbor search technology to power our personalization, recommendation, and searc...
Data-Science
Fri Oct 20 2023
Exclude from Your Taste Profile
What is “Exclude from your taste profile”? Are you a parent forced to put the Bluey theme song on repeat? Do you work from home and play lof...
Sat Oct 14 2023
Increasing Travelers’ Engagement Through Relevant Price Alerts at Expedia Group
model-inference
Speeding Up Inference Pipelines with Model Libraries at Expedia Group
Tue Oct 03 2023
Using Synthetic Search Data for Flights Price Forecasting
North-America
Mon Oct 02 2023
Career stories: The math-music connection in data science
When Javier signed up for a programming course during the pandemic, he had no idea that his career was about to shift from the world of musi...
Birthday Week recap: everything we announced — plus an AI-powered opportunity for startups
Fri Sep 29 2023
Privacy-preserving measurement and machine learning
Keyword-bidding
End-to-end Keyword Bidding for Apple Search Ads
Tue Sep 26 2023
Training Foundation Improvements for Closeup Recommendation Ranker
naming-conventions
Is this a date? Using ML to identify date formats in file names
multi-objective
Tue Sep 19 2023
The Juggler Model: Balancing Expectations in Lodging Rankings
ml-engineering
Thu Sep 14 2023
Expedia Group’s Customer Lifetime Value Prediction Model
Tue Sep 05 2023
MLEnv: Standardizing ML at Pinterest Under One ML Engine to Accelerate Innovation
distributed-systems
Thu Aug 17 2023
AVA Discovery View: Surfacing Authentic Moments
ChatGPT
Wed Jul 19 2023
Don?t you (forget NLP): Prompt injection with control characters in ChatGPT
feature-store
Tue Jul 11 2023
Chronon — A Declarative Feature Engineering Framework
Wed Jun 28 2023
Building Real-time Machine Learning Foundations at Lyft
Experimenting with Machine Learning to Target In-App Messaging
Messaging at Spotify At Spotify, we use messaging to communicate with our listeners all over the world.
Career stories: The power of an impactful mentor
Initially a Chicago-based data analyst, Jelanah had her heart set on a more meaningful career in frontend (UI) engineering.
Speed-Week
Thu Jun 22 2023
Globally distributed AI and a Constellation update
Today we’re announcing new Constellation features, explain why it’s the first globally distributed AI platform and why deploying your machin...
computer-vision
Tue Jun 20 2023
Detecting Scene Changes in Audiovisual Content
New Approaches For Detecting AI-Generated Profile Photos
Co-authors: Shivansh Mundra, Gonzalo Aniano Porcile, Smit Marvaniya, Hany Farid A core part of what we do on the Trust Data Team at LinkedIn...
Products
Docker
Full-Stack Reproducibility for AI/ML with Docker and Kaskada
Learn how Docker and Kaskada improve and accelerate the machine learning development cycle.
recommender-systems
Thu May 25 2023
Generating Diverse Travel Recommendations
Wed May 17 2023
Warden: Real Time Anomaly Detection at Pinterest
Engineering
How GitHub Copilot is getting better at understanding your code
With a new Fill-in-the-Middle paradigm, GitHub engineers improved the way GitHub Copilot contextualizes your code.
Thu Apr 27 2023
Humans + Machines: A Look Behind Spotify’s Algotorial Playlists
TL;DR Since 2017, Spotify has been working to create a better listening experience for our users by applying the expertise of our curators w...
monitoring
Tue Apr 25 2023
Building a large scale unsupervised model anomaly detection system — Part 2
big-data
Tue Apr 11 2023
Big Savings On Big Data
Thu Apr 06 2023
Our Learnings from the Early Days of Generative AI
It’s been an exciting few months at LinkedIn, as our engineering and product teams have been working hard to build some new and advanced AI-...
Mon Apr 03 2023
The Recommendation System at Lyft
Wed Mar 22 2023
Building Airbnb Categories with ML & Human in the Loop
lyft2vec — Embeddings at Lyft
Data-Driven-Segmentation
Thu Mar 16 2023
Understanding a Diverse User Base with Frequency Segmentation at Scale
How we developed a bespoke frequency-recency segmentation to understand our users' diverse usage patterns.
media
Tue Mar 14 2023
Building a Media Understanding Platform for ML Innovations
data-pipeline
Tue Mar 07 2023
Data ingestion pipeline with Operation Management
Thu Feb 16 2023
Prioritizing Home Attributes Based on Guest Interest
Mon Feb 13 2023
Scaling Media Machine Learning at Netflix
Infrastructure
Wed Feb 01 2023
Unleashing ML Innovation at Spotify with Ray
Introduction As the field of machine learning (ML) continues to evolve and its impact on society and various aspects of our lives grows, it ...
deep-learning
Mon Jan 30 2023
Learning To Rank Diversely
Powering Millions of Real-Time Decisions with LyftLearn Serving
clustering-algorithm
Discovering Creative Insights in Promotional Artwork
Thu Jan 26 2023
Scalable Annotation Service — Marken
Wed Jan 25 2023
Improving the customer’s experience via ML-driven payment routing
Co-Authors: Xianyun Mao, Stan Xu, Rachit Kumar, Vikas R, Xia Hong, and Divyakumar Menghani As a LinkedIn member, you can subscribe to Linked...
Tue Jan 24 2023
Deep Learning for Infinite (Multi-Lingual) Keywords
How we used a CLIP-inspired model to suggest keywords for template labeling in multiple languages.
models
Accelerating our A/B experiments with machine learning
software-engineering
Fri Dec 02 2022
Large Scale Ad Data Systems at Booking.com using the Public Cloud
Thu Nov 17 2022
Match Cutting at Netflix: Finding Cuts with Smooth Visual Transitions
Fri Nov 11 2022
New Series: Creating Media with Machine Learning
Machine Learning for Fraud Detection in Streaming Services
Search-and-Relevance
Wed Nov 02 2022
Search Pipeline: Part I
How we are rebuilding Canva's search stack and pipeline.