Eric Landau is the CEO & Co-Founding father of Encord, an energetic studying platform for laptop imaginative and prescient. Eric was the lead quantitative researcher on a world fairness delta-one desk, placing hundreds of fashions into manufacturing. Earlier than Encord, he spent practically a decade in high-frequency buying and selling at DRW. He holds an S.M. in Utilized Physics from Harvard College, M.S. in Electrical Engineering, and B.S. in Physics from Stanford College.
In his spare time, Eric enjoys enjoying with ChatGPT and huge language fashions and craft cocktail making.
What impressed you to co-found Encord, and the way did your expertise in particle physics and quantitative finance form your strategy to fixing the “information downside” in AI?
I first began eager about machine studying whereas working in particle physics and coping with very massive datasets throughout my time on the Stanford Linear Accelerator Heart (SLAC). I used to be utilizing software program designed for physicists by physicists, which is to say there was quite a bit to be desired when it comes to a nice consumer expertise. With simpler instruments, I might have been capable of run analyses a lot sooner.
Later, working in quantitative finance at DRW, I used to be liable for creating hundreds of fashions that have been deployed into manufacturing. Much like my expertise in physics, I discovered that high-quality information was essential in making correct fashions and that managing complicated, large-scale information is troublesome. Ulrik had the same expertise visualizing massive picture datasets for laptop imaginative and prescient.
Once I heard about his preliminary concept for Encord, I used to be instantly on board and understood the significance. Collectively, Ulrik and I noticed an enormous alternative to construct a platform to automate and streamline the AI information improvement course of, making it simpler for groups to get the most effective information into fashions and construct reliable AI methods.
Are you able to elaborate on the imaginative and prescient behind Encord and the way it compares to the early days of computing or the web when it comes to potential and challenges?
Encord’s imaginative and prescient is to be the foundational platform that enterprises depend on to remodel their information into purposeful AI fashions. We’re the layer between an organization’s information and their AI.
In some ways, AI mirrors earlier paradigm shifts like private computing and the Web in that it’s going to turn into integral to workflows for each particular person, enterprise, nation, and business. In contrast to earlier technological revolutions, which have been largely bottlenecked by Moore’s regulation of compounded computational development of 30x each 10 years, AI improvement has benefited from simultaneous improvements. It’s thus shifting at a a lot sooner tempo. Within the phrases of NVIDIA’s Jensen Huang: “For the very first time, we’re seeing compounded exponentials…We’re compounding at one million instances each ten years. Not 100 instances, not a thousand instances, one million instances.” With out hyperbole, we’re witnessing the fastest-moving expertise in human historical past.
The potential right here is huge: by automating and scaling the administration of high-quality information for AI, we’re addressing a bottleneck stopping broader AI adoption. The challenges are harking back to early-day hurdles in earlier technological eras: silos, lack of finest practices, limitations for non-technical customers, and a scarcity of well-defined abstractions.
Encord Index is positioned as a key software for managing and curating AI information. How does it differentiate itself from different information administration platforms presently accessible?
There are a couple of ways in which Encord Index stands out:
Index is scalable: Permits customers to handle billions, not tens of millions, of knowledge factors. Different instruments face scalability points for unstructured information and are restricted in consolidating all related information in a company.
Index is versatile: Integrates instantly with non-public information storage and cloud storage suppliers reminiscent of AWS, GCP, and Azure. In contrast to different instruments which are restricted to a single cloud supplier or inner storage system, Index is agnostic to the place the information is positioned. It allows you to handle information from many sources with acceptable governance and entry controls that enable them to develop safe and compliant AI functions.
Index is multimodal: Helps multimodal AI, managing information within the type of photos, movies, audio, textual content, paperwork and extra. Index isn’t restricted to a single type of information like many LLM instruments at the moment. Human cognition is multimodal, and we consider multimodal AI might be on the coronary heart of the following wave of AI developments, which is able to supplant chatbots and LLMs.
In what methods does Encord Index improve the method of choosing the correct information for AI fashions, and what influence does this have on mannequin efficiency?
Encord Index enhances information choice by automating the curation of huge datasets, serving to groups determine and retain solely essentially the most related information whereas eradicating uninformative or biased information. This course of not solely reduces the scale of datasets but additionally considerably improves the standard of the information used for coaching AI fashions. Our prospects have seen as much as a 20% enchancment of their fashions whereas reaching a 35% discount in dataset measurement and saving a whole lot of hundreds of {dollars} in compute and human annotation prices.
With the speedy integration of cutting-edge applied sciences like Meta’s Section Something Mannequin, how does Encord keep forward within the fast-evolving AI panorama?
We deliberately constructed the platform to have the ability to adapt to new applied sciences shortly. We give attention to offering a scalable, software-first strategy that simply incorporates developments like SAM, guaranteeing that our customers are at all times outfitted with the most recent instruments to remain aggressive.
We plan to remain forward by specializing in multimodal AI. The Encord platform can already handle complicated information varieties reminiscent of photos, movies, and textual content, in order extra developments in multimodal AI come our approach, we’re prepared.
What are the commonest challenges firms face when managing AI information, and the way does Encord assist deal with these?
There are 3 foremost challenges firms face:
- Poor information group and controls: As enterprises put together to implement AI options, they’re usually met with the fact of siloed and unorganized information that’s not AI-ready. This information usually lacks sturdy governance round it, limiting a lot of it from being utilized in AI methods.
- Lack of human consultants: As AI fashions deal with more and more complicated issues, there’ll quickly be a scarcity of human area consultants to organize and validate information. As an organization’s AI calls for enhance, scaling that human workforce is difficult and dear.
- Unscalable tooling: Performant AI fashions are very data-hungry when it comes to information wanted for fine-tuning, validation, RAG, and different workflows. The earlier technology of instruments isn’t outfitted to handle the quantity of knowledge and sorts of information required for at the moment’s production-grade fashions.
Encord fixes these issues by automating the method of curating information at scale, making it simple to determine impactful information from problematic information and guaranteeing the creation of efficient coaching and validation datasets. It makes use of a software-first strategy that’s simple to scale up or down as information administration wants change. Our AI-assisted annotation instruments empower human-in-the-loop area consultants to maximise workflow effectivity. This course of is especially essential in industries reminiscent of monetary providers and healthcare, the place AI trainers are expensive. We make it simple to handle and perceive all of a company’s unstructured information, lowering the necessity for handbook labor.
How does Encord deal with the problem of knowledge bias and under-represented areas inside datasets to make sure honest and balanced AI fashions?
Tackling information bias is a essential focus for us at Encord. Our platform routinely identifies and surfaces areas the place information could be biased, permitting AI groups to deal with these points earlier than they influence mannequin efficiency. We additionally be sure that under-represented areas inside datasets are correctly included, which helps in creating fairer and extra balanced AI fashions. By utilizing our curation instruments, groups will be assured that their fashions are skilled on various and consultant information.
Encord not too long ago secured $30 million in Sequence B funding. How will this funding speed up your product roadmap and growth plans?
The $30 million in Sequence B funding might be used to drastically enhance the scale of our product, engineering, and AI analysis groups over the following six months and speed up the event of Encord Index and different new options. We’re additionally increasing our presence in San Francisco with a brand new workplace, and this funding will assist us scale our operations to help our rising buyer base.
Because the youngest AI firm from Y Combinator to lift a Sequence B, what do you attribute to Encord’s speedy development and success?
One of many causes we’ve got been capable of develop shortly is that we’ve got adopted a particularly customer-centric focus in all areas of the corporate. We’re continuously speaking with prospects, listening intently to their issues, and “bear hugging” them to get to options. By hyper-focusing on buyer wants somewhat than hype, we’ve created a platform that resonates with high AI groups throughout varied industries. Our prospects have been instrumental in getting us to the place we’re at the moment. Our capability to scale shortly and successfully handle the complexity of AI information has made us a horny resolution for enterprises.
We additionally owe a lot of our success to our teammates, companions, and traders, who’ve all labored tirelessly to champion Encord. Working with world-class product, engineering, and go-to-market groups has been enormously impactful in our development.
Given the growing significance of knowledge in AI, how do you see the position of AI information platforms like Encord evolving within the subsequent 5 years?
As AI functions develop in complexity, the necessity for environment friendly and scalable information administration options will solely enhance. I consider that each enterprise will finally have an AI division, very like how IT departments exist at the moment. Encord would be the solely platform they should handle the huge quantities of knowledge required for AI and get fashions to manufacturing shortly.
Thanks for the good interview, readers who want to be taught extra ought to go to Encord.