Analysis Report Beyond Toilet Paper - Essential Products with the Highest Demand Online A look at best selling categories and brands in the US from a basket of “high priority” goods
Analysis Report Shopping for Hand Sanitizers Online in the Age of COVID-19 On 4x price increases, rapid price fluctuations, upstart brands and fake listingsCOVID-19 has led to a spike in demand for products like hand sanitizers and face masks. Manufacturers of these products, many of
7 Elements of AI Product Strategy A playbook for building ML powered products, teams and businessesBuilding and selling machine learning (ML) products is hard. The underlying technology keeps evolving, requiring organizations to constantly be on their toes … and the
Predicting the Demand of Products Sold Online Can the demand for an item be predicted before its market launch? Which signals best prophesy future demand? We set out to answer the questions above using the ~10 billion price & demand signals in our database, and our Universal Product Catalog.
Introducing Attribute Extraction from User-Generated Content Attribute Extraction from ecommerce data - the generation of structured fields from unstructured text - is a popular product offering of ours. Our customers use it to improve the quality of their search
Analysis Report Patterns in Shipment Delays of Imports into the USA This is the second in a multi-part series in which we attempt to unearth dynamics of the shipping industry, by analyzing publicly available import shipment data in the United States.Vessel delays can
Analysis Report Shipping Line, Port and Route Dynamics of US Shipment Imports This is the first in an article series in which we attempt to unearth dynamics of the shipping industry, by analyzing publicly available import shipment data in the United States.In this article,
Using AI to Automate Web Crawling Classification, clustering and reinforcement algorithms used at Semantics3 to automate the crawling of ecommerce websites
Weak Supervision & Active Learning - Essential Tools for Machine Learning Projects Weak supervision and active learning - a couple of key underutilized ideas for when the performance of your ML algorithms plateau, and the marginal utility of further investment in the project drops
The Ecommerce Knowledge Graph - Semantics3 Labs An Ecommerce knowledge graph built from Semantics3's Universal Product Catalog. Useful for exploring supply chain insights, brand profiles, compatible looks and more. Made possible by our crawling and extraction engines.
AI-based HTS Code Classification: 5 Technical Ideas for Building Solutions that Work Imports, exports and tariffs are quite the theme in the news these days, be it in the context of Brexit, the US-China trade war or the Iran nuclear deal. Executive decisions on what
Analysis Report The State of Ecommerce - 2019 Report We analyzed 138 million dotcom sites, with a focus on ~6 million ecommerce sites. Here's what we found.
Principles for Managing Data & Product Quality Measure everything, set realistic targets, Think statistically and 6 other principles for managing product/data quality
Mapping the Universe of E-commerce Brands Of the thousands of attributes that we handle while curating product catalogs, the hardest and perhaps most important attribute is brand. Consumers often begin their searches with brand names of the products they're
How we do Data QA @ Semantics3: Processes & Humans-in-the-Loop (Part 2) In this second of two posts about data quality, I'd like to delve into the challenge of building and maintaining evolving datasets, i.e., datasets that are function of changing inputs and fuzzy
How we do Data QA @ Semantics3: Statistics & Algorithms (Part 1) In this first of two posts about data quality, I'd like to delve into the challenge of building and maintaining evolving datasets, i.e., datasets that are function of changing inputs and fuzzy
How to Launch and Maintain Enterprise AI Products There is a significant disconnect between the perception and reality of how enterprise AI products are built.The narrative seems to be that given a business problem, the data science team sets about
Deriving Meaning through Machine Learning: The Next Chapter in Retail Three slides from Benedict Evans' brilliant talk, The End of the Beginning, really caught my attention.The Old and the NewAcross industries, machine learning is helping us get to successive levels of meaning
5 Lessons I’ve Learned Tackling Product Matching for E-commerce Slides & video from my talk @ Fifth Elephant
The GIGO Principle in Machine Learning And its implications for PMs, designers, salespeople and data scientists
“Hot Dog and a Not Hot Dog”: The Distinction Matters And why Periscope Should’ve Held Out for a Little Longer
Questions & Intuition for Tackling Deep Learning Problems Working on data-science problems can be both exhilarating and frustrating. Exhilarating because the occasional insight that boosts your…
engineering Debugging Neural Networks: A Checklist You’ve framed your problem, prepared your datasets, designed your models and revved up your GPUs. With bated breath, you start training…
engineering Geosearch Support for Limestone (Node.js Sphinx Connector) Limestone, a Node.js connector to Sphinx search server, is a quite handy library written by Serge Shirokov.
engineering How We Built an iOS App, an Android App and a Node.js API in 20 Hours My typical app development workflow involves brainstorming, idea formation, validation, feature selection, design iteration and eventually…