Full Transcript
Andrew Weinreich:Ā On several occasions, Iāve had conversations with computer algorithms. My guess is that many of you have as well, although you might not have realized it at the time.
It starts out the way it always does: you email someone you want to meet, and they respond, copying their assistant, Amy, to set up the date. Everything goes exactly as expected.
In another context, I might have thanked the assistant for his or her help. But for this conversation it wasnāt necessary: Ā because, the assistant named Amy wasnāt a real person. Ā She was a bot ā powered by artificial intelligence. I knew that because it was revealed in her signature.
X.ai is the company thatās developed Amy, and her male counterpart, Andrew. Theyāve raised $44 million to develop AI that will take scheduling out of human hands. X.ai Co-Founder and CEO Dennis Mortensen explains how it works:
Dennis Mortensen:Ā It works very similar to that of you having hired a person to sit in your front office and manage your calendar. Meaning that when somebody emails me and say, āHey, Dennis. Do you got time to meet up for coffee?ā I can simply just within my inbox, reply back and say āHey, you know what? Iām up for coffee. I have CCed in Amy at x.ai and she can help put something on my calendar in the next couple of weeks for half an hour at 200 Broadway. Then I click send. And then, I click archive. Because this is not my job anymore. And Amy will now understand what I asked her to do, remove me from the dialogue, reach out to one or more participants, negotiate a time at my office for half an hour, and upon conclusion, send out an invite.
āI click send. And then, I click archive. Because this is not my job anymore.ā
Andrew Weinreich:Ā Dennis envisions a world where for every basic task like scheduling, weāll call on an AI assistant that can summon up other, more specialized AI assistants, to do those tasks for us. But what makes scheduling so complicated that it takes millions of dollars in funding to build this technology?
Dennis Mortensen:Ā Because people donāt say, āAmy, comma, can you set up a new meeting?ā No, they say things like, āHey, Amy, Lars and I need to the hokey pokey next week, can you make it happen?ā Hokey pokey? As in, who is this guy? What is he talking about? Well, we still need an algorithm to understand that.
Andrew Weinreich:Ā And to train an algorithm to understand and take action on this takes a lot of human-powered work.
Dennis Mortensen:Ā If you asked me where a very large amount of our funds went, it will be into labeling. As in, we have today about a hundred people full time that does nothing but labeling.
Andrew Weinreich:Ā Hereās how itās done.
Dennis Mortensen:Ā You could imagine really that I gave you a piece of paper and four highlighters. And I said, āI need you to highlight all of the intents with the blue highlighter. And I need you to add a note what intent that was? Was that a cancel meeting, reschedule meeting, running late, add participant, make Andrew optional, and so on and so forth. I need you to use the yellow highlighter to mark any dates and times. I need you to use the pink highlighter to highlight any location. Did he say 200 Broadway? Did he say New York? Did he say Chelsea?ā ā¦That type of labeling we obviously just do in a digital form. And the more you do that, the more we can get to build a corpus for where we have an understanding of how do people ask to set up a new meeting?
Andrew Weinreich:Ā Iām Andrew Weinreich.
Jeremy Levy:Ā And Iām Jeremy Levy. Youāre listening to Deciding by Data, the podcast that brings you into the C-Suite to uncover how data powers successful businesses. Today on the show, youāll hear from Dennis Mortensen, the Danish serial entrepreneur who hopes the next time you schedule a meeting, you say, ānot my problem.ā
Andrew Weinreich:Ā Dennis, thanks for joining us.
Dennis Mortensen:Ā Thanks much for having me.
Andrew Weinreich:Ā Would love it if we could start with a little bit of background on you. And then, talk about your company. Maybe you can tell us what youāve been doing, prior to x.ai. And then, lead us into the inspiration behind the company.
Dennis Mortensen:Ā Sure, I will kind of spare you the four-hour seminar on just how fantastic I am. Iāll leave you the number for my mom, and she can do that version. But, the short version is one for where weāve now spent a good, 23 years trying to extract value from data. So, our venture before this was a predictive analytics venture for media. And the venture before that, was a enterprise web analytics company. And the venture before that, was a [inaudible] analysis company so thereās certainly some fondness of data, and if anything I could do the next eight hours on how to extract value from data.
Andrew Weinreich:Ā Where are you from originally?
Dennis Mortensen:Ā Oh, the funny accent? Iām Danish. Or as all my American friends will call me the socialist. So, thereās another two hours we can talk about.
Andrew Weinreich:Ā And the first company was started in Denmark orā¦?
Dennis Mortensen:Ā So, we did our first venture out of Denmark. We did the next one out of Denmark, then we did one out of Budapest. I spent four years there. So, if you ever want to talk about sales skills, try to persuade your wife and kids to move to Eastern Europe. Before it joined the European Union by the way.
Andrew Weinreich:Ā Wow.
āIāve turned entrepreneurship into a lifelong career. As in, I have no backup plan. As in, this is exactly what I want to do.ā
Dennis Mortensen:Ā And the next one was in New York. And we are in New York today as well. So, Iām a fan ā Iām so much a fan that I became a citizen. What is it now? A year and a half ago, just before you guys took a turn to the right, and ā nothing wrong with that ā but, Iām certainly very most impressed with just the willingness to participate in any form of entrepreneurship. So back home, certainly when I grew up, so, Iām 45, sadly, because Iām getting older. If you told people that my career choice is one of entrepreneurship, it really sounded like, āOh, so you werenāt able to find a really half decent job, so your backup plan is try to make some money for yourself,ā which was not the story. The story was one for where I think I have an idea so good, I can go out and create a company around it. Anybody here, which I speak to, if you say the word entrepreneurship, that is kind of on par with you saying, āIām a fucking rock star,ā as in I play the guitar but I donāt play the guitar but I do tech. And thatās I think is something I certainly appreciate, because Iāve turned entrepreneurship into a lifelong career. As in, I have no backup plan. As in, this is exactly what I want to do. And weāve done it now five times over.
Andrew Weinreich:Ā Yeah. Good for you. Good for you. Tell us a little about the inspiration behind x.ai.
āI did 1,019 meetings in the year 2012. What was even sadder? I saw that I did 672 reschedules.ā
Dennis Mortensen:Ā The true story, and itās going to sound like one of those eBay-like made up stories. But, the true story is that, in our past venture, post-exit having sold it, what happens is you end up with a little bit extra time on your hand as you do the integration. And one day, I did this kind of odd, sad thing of going back into my calendar to see exactly how many meetings did I do the year prior to having sold the company, just for the fun of it. And what I ended up seeing was, that I did 1,019 meetings in the year 2012. What was even sadder? I saw that I did 672 reschedules, as in, I set up all those bloody meetings myself. No assistant. No help. Just me in my underwear at home, at 11 PM. And at that moment, I thought, āYou know what? That is a task Iāve done for now 20 some odd years. Am I supposed to do that for the next 20 years?ā As in, that doesnāt sound like a version of the future I wanted to live in. And if that wasnāt a version of the future I wanted to live in, what might that look like? And when you speak to people, we all had the same idea of what the future might look like. Either you win in the corporate lottery and you become SVP of whatever, and you get a human assistant, and you donāt have to do this chore anymore, or tech somehow solves it. And you get some agent in place who could do it for you. And it just felt like there might be an opening in late 2013, to go build that agent. So, that was the catalyst. And at that moment, it certainly looked like there might be an opening for somebody to go out and build this intelligent agent, that could do this particular task.
Andrew Weinreich:Ā Was that the moment where you said this is an opportunity to use artificial intelligence, or were you focused on: Is it possible that we can use a human agent to solve this problem?
Dennis Mortensen:Ā So, as an entrepreneur, Iām actually less focused on the technology. I think any entrepreneur ā and Iām obviously biased on my own processes ā but, I think you should less onto the pain. Because the pain, if true and honest, never change. The technology might change actually over time such that, you hate your commute. Then we can try to solve that for decades by giving you a car, probably even kind of half a century. But then, at some Ā point, that particular technology actually doesnāt really work anymore. And then you want something else. But, the pain is still true. Which is that, you hate your commute.
And this is the same. Which is that, if I asked you: āDo you like setting up meetings?ā And youāll tell me āNo Dennis, I f***ing hate it.ā āOh, if you hate that, how can we then figure out how to remove it?ā And I think thatās what we latched onto. But, removing it is one for where the pain is so obvious, that a ton of people have tried to remove it before. And at least to me, it looks like they all walked down the same avenue, which is what they did when they implemented some extension or some plugin, or you Tungle.me, or you Doodle me, or you do whatever. And all good honest people, smart people who try to solve it. But, Iām not looking to have the pain alleviated a little bit. I want it gone. And it certainly looked like the only way to completely remove it was to hand over that job to somebody else. But, handing over to somebody else, if thatās a human, thatās just a very expensive product. As in youāre going to pay 50K a year to have a human sit in your front office and do this for you. So, it canāt be a human. It needs to be something else. Some sort of machinery.
āIām not looking to have the pain alleviated a little bit. I want it gone.ā
Andrew Weinreich:Ā Before we drill down into how x.ai works, Iām curious whether you try to quantify this problem. I read a blog post. I think by your co-founder, that 87 million Americans spend five hours, I think it was five hours a week scheduling. How dramatic or profound did you find this problem? Did you think that if you could change scheduling, you could increase productivity of people, and therefore, literally impact our gross productivity? Or did you think about this more as a corporate expenditure challenge or opportunity? How did you think about this problem and quantify it?
Dennis Mortensen:Ā Youāre certainly right that any entrepreneur, whether just in your kind of challenge of pitching it to yourself, or pitching it to your investors or co-founders, or team members would have to come up with some sizing of market. As in, am I selling this for me and my mom? Or is it something a little bit bigger out there? The funny thing about this particular venture, and any one of our past ventures have had some sort of a damn slide in our pitch deck. As in, āDennis, tell me exactly what is the total addressable market for this thing youāre trying to solve?ā For this venture, funny enough, that slide didnāt even come into question. It wasnāt even part of our deck. As in everybody just knew inherently that this is massive, as in we could do a study right now, go down to Broadway and ask the first 20 knowledge workers; one, do you touch a computer in one way shape or form this week? Theyāll say yes. Do you do a meeting? They will then say yes. Do you like setting up? They will then say no. And that just holds true in any form of survey you would do.
And what we found is, when we started to kind of look into the size of it is that, it was massive. Massive to the kind of, if somebody solved it, forget about whether we solve it, if somebody comes along and solves it, thereās a real visible impact, on if not just domestic, but global productivity growth. As in this is something which isnāt a blip. Weāre talking hours and hours on, at least in the US about short of 90 million US knowledge workers. Thatās certainly something that was strong when we had to go pitch this for funding that people immediately saw that if now is the time, the market is massive. Just like you havenāt read a single story, probably in any publication where somebody suggests, the market for self driving cars, weāre not sure how big it is. We just all assume that it is massive to the extent for we probably donāt even have to talk about it. We have to talk about the technology and is it even kind of feasible at this moment in time, or should we wait kind of a couple of decades? This was very much the same.
Jeremy Levy:Ā So, the term AI is really popular these days. And I find itās really helpful, could you help us understand, what does AI even mean?
Dennis Mortensen:Ā Thatās a very good question. And given that, thereās people out there taking their degrees in AI, we should probably have some sort of fixed explanation for exactly what it is. I like the idea though of AI being some mechanism, thatās got the ability to do a job or a single task autonomously. Meaning, that thereās something here which if I donāt do it, you would do it, or somebody else would do it. And it becomes AI when you can kind of hand over that task or job to some piece of machinery, whether that be setting up a meeting for me, driving the car for me, finding that three faces in that picture uploaded to Facebook. But, doing a job that a human would otherwise have to do.
Jeremy Levy:Ā I mean, there are so many examples of where technology make decisions on our behalf. Where does the sort of line go between like a simple, if statement, and sort of branches to do one thing versus another, versus actual intelligence? Where is that threshold? Where is that line?
Dennis Mortensen:Ā So, I certainly like the artificial part of artificial intelligence. Given that, we are not even entirely sure what true intelligence is all about. As in, we donāt yet have some paper that came out a decade ago that concluded that human intelligence is this. And now, itās just a matter of time before we can figure out how to replicate that. As in, we just donāt know. So perhaps we should focus on the artificial part. As in, thereās some decision-making here in the machine that we do understand, that can do a task and a job that if it didnāt do it, you would have to do it. So, that is something which I keep, at least personally, latching onto. And I actually donāt mind the simplicity in some of the technology. As in, for me, that can still be artificial intelligence. And we still have this, in particular with artificial intelligence, conundrum for where once we saw something, and it used to be sexy yesterday upon having solved it, itās just not sexy tomorrow. And then, we forget how interesting that problem was.
Andrew Weinreich:Ā Let me just drill down on Jeremyās question. Because we had, for example, online dating services. Thatās a machine we had, instead of a in-person matchmaker, thatās a machine that answered a query and that did a personās job. But, we wouldnāt call that artificial intelligence. So, when Jeremy says, āWhereās the line?ā, there seems to be a higher level of intelligence than simply a machine doing a personās job and so Iād love it if we could push you a little further, and say how would you define- Weāre seeing it, sort of like blockchain, right? We see every company, if you want to be a sexier company today, no matter what youāre doing, youāre saying āIām in blockchain.ā And we have that same dynamic with AI. And itās not clear to me what the definition is youāre offering of AI thatās distinct from machines handling human tasks, that theyāve been doing for decades and decades and decades.
Dennis Mortensen:Ā Letās unpack it a little bit more. So, I certainly think thereās some value in making the distinction between the conversational UI and the agents that exist within that conversational UI. So, we now have this moment in time for where a new UI is about to arrive. So, I took my CS degree on the command line, and that was the UI. The only access to compute was, if you could figure out the syntax on the actual command line. Then, we got the graphical user interface and we can kind of semi-democratize access to compute. And reasonable people within other verticals could get access to it. So if youāre in CPA, you can get access to a spreadsheet, and you can now use compute. And then perhaps the mobile use interface is a UI paradigm in its own right.
But, thereās this moment now for where people are moving into this conversational UI, for where, at least in my opinion, we will fully democratize access to compute. As in anybody who can speak even a funny Danish-English sentence like me will get access to compute. Well, thatās an interface. That is just you wanting to speak to your computers instead of clicking a dropdown, or a checkbox, or a radio button. But, that to me is not intelligence. That is just a new UI.
Now, when you get the ability to speak to your computer, what happens is that, you will be less focused on tasks and more focused on objectives. As in, I want something done. I actually donāt care how you do it. I just want something done. And those agents within this new UI, I think are AI agents. They donāt have to be. They can also just be humans. Iāll give you a good example here for where I think this new UI paradigm really starts to kind of make sense.
So, one of my favorite examples is this: Iāll have to go to San Francisco on the 25th of January. And as I go there, Iāll stay at the Hilton. It will be 11 PM when I arrive, and I want a Diet Coke. My first line of thought, is not one for where I want to use the existing kind of UIs. As in āOh, Diet Coke, let me go to the AppStore, let me find the Hilton app, let me install the Hilton app on my iPhone, let me set up a new account if you donāt have an Hilton account, let me log in with my new credentials. Upon logging in, let me find the store, let me add Diet Coke to basket, let me click checkout.ā As in, that just doesnāt work, which is why the phone and the conversational UI still works. I pick it up. I click room service. I say, āHey, can I get a Diet Coke for 1920, please?ā That though, is a human agent on the other end of that phone call. But perhaps, itās just an Alexa in the room, or a big-ass number on the front door, I can text the downstairs lobby and say, āHey guys, can you bring up a Diet Coke?ā
When they receive that message, thatās where I think AI will start to come into play. As in, it might just be too expensive to have a human agent read and interpret that message and figure out what action to take, perhaps in most of the cases, in most lobbies, in most hotels, itās simple, ācan you get me a Diet Cokeā requests, and a machine should just take that over. And thatās where I think it will become somewhat easy to just make the distinction between humans and AIs, where they will read the request, and they will take the action.
The sophistication of some of those agents will be super simple, and weāll be embarrassed to call them AIs. Iāll give you one of my favorite bots right now, outside of x.ai, which is Citibank, and we can have a laugh about that. But they used to call me, āHey, Dennis. I can see that you spent three dollars, but itās in Singapore. Did you buy that bagel?ā āUh, yeah. First of all, I think Iām paying AT&T three dollars just for you to call me right now. So, why are you trying to confirm this?ā But, that was the mechanism which they put in place, they had a human agent called me to verify a specific buy outside of my normal pattern. Now, they just text me. Reply one back if true. Reply two back if not true. Damn, thatās an awesome bot. As in, I hated that kind of, āI think Citibank is calling me.ā I didnāt have all the numbers for Citibank. It was just kind of one of those calls where it could be them. I think I have to pick it up.
Jeremy Levy:Ā Help us understand what are the other components that fall into sort of this AI bucket? So, one is obviously the interface. You talked then a little bit about sort of the actionability around it. What else is required to make an AI around scheduling meetings? Help us understand the other components involved.
Dennis Mortensen:Ā Sure. So, we certainly like to see it as a kind of three-pronged challenge, which consists of one, the NLU, which is the Natural Language Understanding, or your ability to read the message and understand it in full. Say that, weāve set up a meeting for today, and you email Amy and say, āHey, Amy. Iām going to be running five minutes late.ā That happens all the time, Iām sure you can imagine that. But, if I understand it, which is a big āifā. That is super hard, given that language is not a solved science, and the only hope you have is to pick a vertical so thin, that you might be able to solve language within that kind of vertical. But, thatās the first challenge, the read challenge. Now, we make the assumption that you did understand it in full.
Jeremy Levy:Ā Is all of that automated today is just the entire understanding from a natural language perspective automated?
Dennis Mortensen:Ā Iāll tell you about how we got about to actually solving. So, thatās the first challenge. You need to understand it. But, letās say I do understand it. Youāre running five minutes late. Not as in, I was able to pick up the temporal data, and I was able to pick up the intent or the location, or whatever. As in I truly understood it. Then, I need some sort of reasoning engine. As in, youāre running five minutes late. What does that mean? What do I do with that? You and me might just think thatās common sense. You probably do nothing. As in, five minutes, all good, stay tuned. Iāll just wait if you drive a little bit later. But, there is no such thing as common sense. You have to kind program for that. So weāve had, and anybody building any agent, we have to kind of build some sort of reasoning engine within their very confined universe.
Jeremy Levy:Ā Meaning, in the context of scheduling, we can make certain assumptions about what people mean when they say five minutes late.
Dennis Mortensen:Ā So, we have no aspiration, or are not as naive to believe that we can build a human-like agent here. What we can build perhaps, if weāre good, is some agent that can exist within a meeting scheduling universe. That means, if you email Amy and say, āHey, you know what? I actually think Chelsea is going to win the Premier League this year.ā She will have no idea. They will, but she will have no idea. But, if you do something within her universe, sheāll understand everything about it.
Andrew Weinreich:Ā When you say that you have no aspiration of expanding beyond scheduling, I assume you mean in the next couple years. Because Iām guessing that at some point, this idea of the challenge of language and establishing context, or understanding context regardless of the topic is a solvable problem.
Dennis Mortensen:Ā Iām not really a huge fan of people trying to do seven things. Especially if youāre a startup, you can barely do one thing. So, being kind of half-assed at seven things doesnāt really get you far.
Andrew Weinreich:Ā I got it for you, but we hear from Elon Musk that we have to be afraid of artificial intelligence. So Iām asking you, are you saying that the challenges associated with language and understanding context, are such that, weāll never solve this problem? Itās a 20-year problem? When I say we, Iām talking about understanding language, notwithstanding that you are operating in one vertical or there is not the context for knowing what vertical youāre in.
Dennis Mortensen:Ā I am certainly not convinced that we will get some sort of press release from the Googles of the world in five years that will say, āHey, tada, weāve solved it.ā We now have an agent for where any question you have, weāll have an answer. Any job which you can imagine, will be able to do that job. As in, I cannot, even in my wildest dreams get to that end state. What I do think will happen though, is that youāll see thousands of these highly verticalized, highly specialized agents that can do one job, but do that one job really, really well.
Jeremy Levy:Ā So, does that mean I have to have an agent for every specific nuanced task that I want to get accomplished?
Dennis Mortensen:Ā Given that I donāt believe in the single agent scenario, that this oracle will arrive one day. As in I canāt play out that scenario. I could be wrong, but it doesnāt ring true to me. What I think will happen is, which you allude to here, but not in the kind of setting for where, āHey, Dennis. I donāt want to hire 22 single individual agents and try to figure out how to work them.ā I do think youāll end up doing just that though. As in, I think youāve done it already. So, pick up your iPhone and look at it. What you did is that, at least on average, people have a little bit short of 100 apps on their phone. That is you picking, really, just within the existing UI paradigm, about 100 apps that you need to go about your day. I need this to search, this to email, this to calendar-
Andrew Weinreich:Ā But weāll look back, you know, 10 years from now, weāll look back, and weāll say, āThat was the craziest experience in the world, right?ā That in order for me to figure out the ferry that I was going to take here, I had to look at the ferry app. In order for me to look at the calendar, or my schedule, I have to look at the calendar app. So, I think weāre trying to understand your perspective. Is there a router in the future? Is that Siri? Is that Google Home? What does the dynamic of the future look like? This doesnāt strike me as sustainable, that I have 100 agents, all that develop this level of AI, and that perfect the intelligence, but, itās a clumsy experience.
āā¦weāll end up with one core enabler AI, or you can call it a horizontal AI like Siri, Alexa, Cortana, and so on and so forth. And it will be your job to figure it out, which specialized agents do I want to employ, which my enabler AI, will then have to manage on my behalf.ā
Dennis Mortensen:Ā See, I think itās going to play out exactly as you suggested, which is that weāll end up with one core enabler AI, or you can call it a horizontal AI like Siri, Alexa, Cortana, and so on and so forth. And it will be your job to figure it out, which specialized agents do I want to employ, which my enabler AI, will then have to manage on my behalf. So that, when you say, āHey, Siri. Can you get Dennis and I together the first week of February when Iām back in Manhattan, please?ā Siri needs to understand that, that is a job for which I donāt have the skills to solve it. But, what I do know is that, we got somebody on payroll who knows how to solve that. Let me package the request, send it over to Amy. She can work it over the next day and a half. Once sheās solved it, she should then package it, and go back to Siri and say, āHey, remember that thing you gave me on Monday? I now solved it. And you can go back to your boss and say so.ā And that I find really interesting. I find it interesting not only because I think it provides a very similar setting to the kind of app economy, which we have today, that will allow startups like ours to exist. Because we are highly specialized in doing this kind of one thing and doing it really, really well. But what will change then is that, your responsibility as an employee, or just as a good knowledge worker is one for where you would have to pick who is my foreman, or my enabler AI? And which agents do I want to hire? How do I want to train them, and how do I make sure that they become better and better? Which suggests, at least in my version of the future here, some setting for where we all become managers. As in, you will have to learn how to manage agents.
Andrew Weinreich:Ā But how does that work? I mean, youāve raised I think $60 million, is that right?
Dennis Mortensen:Ā About 45.
Andrew Weinreich:Ā $45 million. Most of the apps on my phone have not raised $45 million. In fact, I would say a few have. And Iām trying to understand the level of sophistication thatās necessary to bring an AI application to dominate a vertical ā calendaring in this case ā seems like extraordinary. So, Iām trying to figure out how that plays out. Siri is hitting the APIs of these different functional components. And how many companies will there be like you that are capable and properly capitalized to play a role in this emerging world?
Dennis Mortensen:Ā If the App Store is any predictor for where today, youāll see about two and a half million some odd apps in the Apple App Store, and a similar amount in Googleās store. I find it hard to believe-
Andrew Weinreich:Ā Most of those were built with $20,000.
Dennis Mortensen:Ā Correct. And I think the same will be true here. And some were built with hundreds of millions of dollars. As in, some particular games have hundreds of people working, years on end trying to bring that to market. And I think you will see the same type of power law applied to these agents. Where some will be extremely sophisticated. Some will be middle of the road. And some will just be two guys hacking away over the weekend on some free API, some predictors from AWS, and good for them. It will be so specialized, that it really only works for 800 people at some high school in Denmark. And theyāll be fine for that particular agent, it only really solves the, when you have a half hour free in your schedule in that high school in Copenhagen.
Jeremy Levy:Ā You mentioned AWS for a second. What happened when the company was founded that allowed x.ai to happen now? How was the sort of AI technological ecosystem evolved, such that the opportunity for x.ai is now? And whatās going to enable that in terms of what you just described in terms of the App Store of individual specific agents to make that easier in the future?
āIf you go engineer a self-driving car, you canāt have a footnote saying, āHey, by the way, for every 1000 miles, weāre going to hit a pedestrian.ā As in, even though that will be fantastic software ā if you can drive a thousand miles in Manhattan and only hit one pedestrian, Iām pretty amazed, as in, thatās awesome. Itās probably not going to be commercially viable.ā
Dennis Mortensen:Ā And by the way, youāre seeing this App Store happening right now. So, If you go to the Alexa ecosystem, what you see is that they have what they call a Skill Store, which is very similar to the narrative I just provided here. So, itās not that Iām a single individual whoās super crazy. As in, Amazon thinks my version of the future is going to play out as well. But, in regards to the entrepreneurs and their access, I think you might have to, at least, in a simplistic way, look at AIs in two ways. AIs that live in some high-accuracy setting and AIs that live in some low-accuracy setting.
And itās not a matter of you always want to be in a high-accuracy setting. As in, thereās plenty of applications where you donāt need high-accuracy. If you go engineer a self-driving car, you canāt have a footnote saying, āHey, by the way, for every 1000 miles, weāre going to hit a pedestrian.ā As in, even though that will be fantastic software ā if you can drive a thousand miles in Manhattan and only hit one pedestrian, Iām pretty amazed, as in, thatās awesome. Itās probably not going to be commercially viable. So, that needs to be extremely accurate. We happen to be in the high-accuracy space as well for where, if I didnāt turn up today for this particular meeting, thatāll be just not cool. Not cool to the extent for where, we probably couldnāt exist as a commercial viable product in market.
But, thereās plenty other applications where you donāt need that. Right now, take a picture of the four of us in here. Upload that to Facebook. And they might just only recognize two of us. Thatās 50 percent accuracy. Thatās low, as in. Thatās hitting a pedestrian for every 100 meters. But, it doesnāt matter. Itās one of those where like, āOh, nice. Thanks, Facebook.ā You might even just go click the other two faces. You might not- as in, thatās a low accuracy thing for where youāre actually fine with that. And you need to probably pick one of those two, or understand which one you land in with the product youāre trying to bring to market. I think thereāll be plenty of low-accuracy products for where, you and me tonight could spin up a set of instances on AWS, collect a little bit of data, use their set of prepackaged services-
Jeremy Levy:Ā Is that the enabler though? The fact that we can sort of spin up, compute power sort of at a whim? Is there software that is available now publicly that wasnāt previously available? What are the building blocks that have allowed x.ai to exist today? Whereas we had conversational interfaces on IM, you know, 12 years ago, 15 years ago, 20 years ago, I think.
Dennis Mortensen:Ā Taking a few steps back, I think what people perhaps donāt fully appreciate and what we kind of left out of this kind of half-technical discussion is one for where the very first challenge is one for where, if you want to go attack a vertical, you need to be able to walk up to the white board and describe that universe in full. Not part of it. Not most of it. So, if you donāt describe it in full, how can you ever create an agent that can navigate that space?
Andrew Weinreich:Ā So, give us another vertical. Give us another small example of a universe described briefly in full, other than calendaring.
Dennis Mortensen:Ā Business travel for direct flights, something for where, I do what, 20 trips a year, I want a fair price but Iām not price sensitive and I really just want to go from here to San Francisco, and I need to be there for my meeting at 6 oāclock. End of request. Whoās the agent? Me. That means I go to Expedia. I go do all the searches. I pick JetBlue, or United, or whatever. I donāt have any loyalty. And then, I find the best time. I then go book it. But really, there was a time when we had human agents, called travel agents. And I could say, āHey, can you get me to San Francisco next Thursday by 3 PM so I can be at my meeting at 4 for less than $800, please.ā And then they would just make it happen. Or doing my receipts. There was a time for where, you wouldnāt be sitting in your desk kind of half crying, figuring out how to get $5.40 reimbursed from Pret, now, you might take little pictures, but youāre still doing it. There was a time some people just take all the receipts, give it to somebody in the office. They would sort through it. So, thereās plenty of those. But, you need to be able to map out that universe in full. As in, what exactly do I mean when I say meeting scheduling? The good thing about my particular vertical is that, we are almost in instant agreement about what that means. When we say business travel, direct flights, we are certainly quite close. When we talk about receipts being reimbursed, we might have different opinions. Equally good opinions about what it is, but different opinions. But, you need to be able to map that out.
Andrew Weinreich:Ā But we shouldnāt have different opinions, right? The IRS prescribes, I mean [Dennis laughs] ā presumably, you should be able to incorporate some standard for that.
Dennis Mortensen:Ā But, we might have different opinions about where the pain is embedded within this particular task. There are certain parts of it that I donāt care about. There are certain parts that I really care about. And the first thing, once you mapped that out, then you need to figure out, now, that I know the universe what my agent needs to exist in, what data can I then collect so I can create some model, which can kind of replicate decisions in this space? Where does that data come from? How are you going to collect it? And how are you going to label it? And those two kind of baby steps turns out to be perhaps, really the barrier to entry for anybody who wants to do anything in the AI space. And they sound so unsexy that we barely want to talk about it. But, that is really where I think you win or lose. Can you map out your space and can you collect some data that represents all agents navigating around in this space? We spent, just to give you some sort of scope here, four years with the majority of the team, by the way, just labeling data. It is so unsexy that I want to cry.
Andrew Weinreich:Ā What does that mean, labeling data?
āā¦all of that data needs to be labeled, so that becomes understanding so that hopefully in the not too distant future, you can make a decision that a human would otherwise have made by reading this text.ā
Dennis Mortensen:Ā Labeling data means that you shoot Amy an email saying, āHey, you know what? Iām going to be running five minutes late.ā Letās just text, unstructured data, that doesnāt mean anything. For that to have any value, we need to label it. As in, you could label it for an intent called ārunning late.ā It could be labeled for āreschedule.ā If you said, āYou know what? I donāt think Iām going to make it. Perhaps we can do early next week.ā Then I need to label for the reschedule intent. But I also need to kind of label for some temporal data because you said early next week. And all of that data needs to be labeled, so that becomes understanding so that hopefully in the not too distant future, you can make a decision that a human would otherwise have made by reading this text.
Andrew Weinreich:Ā Are you labeling that data relative to me? Or youāre labeling that data to try to understand an entire new dictionary of-?
Dennis Mortensen:Ā Entire new dictionary. Good question. The reason that we are not labeling it distinctly to you is that, for meetings, you might, if you are as aggressive as me, do a thousand meetings a year.
Andrew Weinreich:Ā Well, I actually meant more, if I tell you Iām five minutes late, what I really mean is Iām 10.
Dennis Mortensen:Ā Yes.
Andrew Weinreich:Ā Otherwise I wouldnāt write you, right?
Dennis Mortensen:Ā Yes.
Andrew Weinreich:Ā I would just show up Iām five minutes late. So, Iām trying to understand when we talk about AI and calendaring, it would seem to me that the words people use mean something different relative to the person whoās using them.
Dennis Mortensen:Ā So, the way we thought about this is certainly one for where we think the data is just too sparse if we go down to the individual. As in, youāre not running late so often, that I would really have a large data set on you running late. You might be running late 40 times over the next year. Iām just picking a random number here. Thatās 40 data points. Thereās not much modeling youāre going to do in 40 data points. So, we have to do it network-wide. However, what we are very likely to do is when we move this to not a new language, but a new culture, because we might say this should exist in Japanese or Italian, itās not so much the fact that we need to exist within a new language, itās that we now exist within a new culture for where thereās a different understanding of running late in Italy or Japan, Iām most sure. Well I can certainly, Iāll give you a funny story here, which is true and itās a feature that we used not to have. So, where Iām from, Northern Europe, we donāt double or triple confirmed meetings, which is very American. So, this idea that we set up a meeting for March 10th. Then three weeks prior, āHey, Dennis. Just checking in. Iāll see you on March 10th.ā Then the day before, āHey, Dennis. Iāll see you tomorrow at 1:00.ā
Andrew Weinreich:Ā Iām assuming youāre going to stand me up. Thatās the American culture.
āā¦weāve had to engineer part of the reasoning engine to adapt to the culture.ā
Dennis Mortensen:Ā Hereās the funny thing. For every one of those I get Iām thinking, āYeah, I know. Itās on my f***ing calendar.ā And the day before, āI know. This the third time you emailed me on this meeting.ā As in, you and me could set up a meeting for December 10th this year. I wouldnāt call you. I wouldnāt email. I wouldnāt do nothing. Iād just turn up at your office on December 10th as weāve agreed. Thatās part my culture. We though have engineered it ā and Iām the only Danish dude around the office, so it doesnāt matter what my culture is. So, weāve had to just live in this environment where people do double and triple confirm. So, Amy does the same now. As in, she will, if we set up the meeting way back, go out and do that very American thing for where, āHey, guys. I know we set this up some time ago. Just confirming everything set for tomorrow. I assume itās all good. If not, do let me know.ā And that is something where weāve had to engineer part of the reasoning engine to adapt to the culture, which kind of brings us back to where you started, that we canāt do it on the individual. We probably can do it on the culture.
Jeremy Levy:Ā Are those the biggest challenges that you have left for x.ai?
Dennis Mortensen:Ā I would say coming back to full circle where you and I started, I named two of the three challenges. I said thereās the NLU challenge, for where we need to understand what is being asked of us. We need a reasoning engine to take the action. And lastly, we need the NLG engine, some natural language generation where I can write back because the reasoning engine will be give me sort of computational outcome which I need to turn into language so that my human constituents can understand whatās going on. The real challenge honestly, is not on the natural language generation end. That weāve solved in full. Weāll continue to perfect it. The reasoning engine, the intelligence that Amy have today is at such a level for where she can operate in a fully autonomous setting, doesnāt need any kind of guidance from humans on the understanding end. We are still and might even forever fight that. Weāre just willing today, if we donāt understand, to send out kind of like a Siri-like message, which is when Siri doesnāt understand, sheāll do the whole, āI donāt have the answer, but hereās 10 pages from the internet that you might want to look at,ā which is-
Jeremy Levy:Ā Worthless.
āā¦if humans were just honest and not lying all the time, the whole thing would just be so much easier. But they are not honest. They lie all the time.ā
Dennis Mortensen:Ā Yeah, not good. We have something similar which is that, āI didnāt really understand that. Could you please clarify?ā Iād obviously, rather be in a setting where I never had to send out that email. But given that we are in this kind of fully kind of mechanical setting, for where we need the machine to just do the task on its own, that is where we keep fighting. And really what we keep fighting here is human ambiguity. And I keep telling this anybody whoās willing to listen, if humans were just honest and not lying all the time, the whole thing would just be so much easier. But they are not honest. They lie all the time. Theyāre f***ing crazy. But we canāt change that. We have to live within that universe where they do exist.
Iāll give you just a simple example here. Say, weāve been talking and weāve been super busy throughout the day. So, tonight at 1:00 AM you email me saying, āDennis, I think I figured it out. How about we get together first thing tomorrow morning and hash things out?ā Happens all the time, right? Itās not true though. Itās not what you want. At 1:00 AM, you donāt want tomorrow. You want today. Itās just that humans use the word tomorrow all the way up to the point where they go to bed not the midnight. Now, the machine has to do two things. One, be super confident at high accuracy, that I picked all the data up as it were. Two, equally confident and say, āI know what you said, but itās not what you mean. So, Iām going to change it for you.ā That is dangerous territory to be in. And itās not this one here. It could be āI wonāt make the meeting tomorrow,ā but you have two meetings set up. Which one do I cancel?ā What negative ramifications are going to come along with me cancelling the wrong meeting? As in, people might be traveling in for this meeting. And that is just really difficult. Weāre working it though, but really difficult. And to answer your question, human ambiguity and the fact that people are crazy. But we have to live with that. And all agents have to kind of live with the fact that is just how humans are.
Jeremy Levy:Ā Maybe humans need to learn a way to interact with agents.
Dennis Mortensen:Ā I hear you, and I got mixed emotions on that. On bad days for where I get punched in the face on Amy having made some sort of mistake, I want to go back and say, āHow about you man up a little bit, and really, just speak like a machine.ā
Jeremy Levy:Ā Be nice to your AI.
Dennis Mortensen:Ā Yeah. Be nice to your AI, right? And then, on other days, I have this idea that I donāt think itās really my job to change my mom, who really just wants for this agent to go deliver some bagels for tomorrow morning. āHey, why didnāt I get those bagels? Why are they coming kind of the day after where I donāt need any bagels. Iām not even home now.ā I got mixed emotions, so I go round in circles here. I would hope though that we as an industry end up in a setting for where we donāt have to change humans too much.
Andrew Weinreich:Ā Are you integrated into Alexa through the Skills?
āā¦almost all of your external meetings happened over email, almost all of your internal meetings could happen over a platform like Slack.ā
Dennis Mortensen:Ā We are not. The two dimensions we are going to expand on is one into other languages, so, we can attack more markets, and into other communication channels, such as Alexa. First one weāre going to do though is Slack. Just because Alexa is sexy and itās a version of the future, kind of Star Trek levels, but itās not really where you set up meetings. So, what weāve seen is that almost all of your external meetings happened over email, almost all of your internal meetings could happen over a platform like Slack.
Andrew Weinreich:Ā But Slack is still very much a corporate product.
Dennis Mortensen:Ā Correct.
Andrew Weinreich:Ā You know itās funny. With my Alexa, I asked my Alexa in the morning what do I have today, because Iām integrated with my Google Calendar. And I love the idea of scheduling meetings from my Alexa. But, you donāt see that as- and I loved your vision that Alexa would be this gatekeeper for all these requests. How far off are we from that? I mean I could imagine, Iām going through the examples you gave: āAlexa, can you schedule these meetings?ā āAlexa, Iāve got a trip coming up.ā In fact, we wouldnāt even want to ask Alexa. Alexa just knew I had a trip coming up and routed the question. How far are we from that vision?
Dennis Mortensen:Ā I donāt think we are very far away from that vision. So, weāve certainly tried to engineer our agent in such a way that it is not an English email agent, but is an agent that can schedule meetings that happens to exist on email and only be able to speak English today.
Andrew Weinreich:Ā But your vision of Alexa routing, how far are we from-?
Dennis Mortensen:Ā That is so close that I might even suggest that itās happening as we speak on some very simple agents. And that means when I want to reorder my Seamless Thai setting, I use Alexa. But thatās because Iām willing to go through something which is a little bit clunky, just for the fun of it. But itās not so clunky that it is not useful. So, I use Alexa for where I open up the Seamless Skill and I reordered the same set of Thai food with me and [inaudible] gets once every two weeks. And that works quite well. And that kind of brings me one baby step into your version of a future.
Andrew Weinreich:Ā Itās your version of your future, but x.ai, Alexa, am I one year away? Two years away?
DennisĀ Mortensen:Ā In the not too distant future. So,we have grand aspirations of you being able to wake up Amy and/or Andrew on any communication channel. As in, if you talk about meetings, the next thing you should think about is I need to wake up Amy because this is not my job. And if you get an email, you reply back, CC her in. And if you get a Slack message, you āatā her and she makes sure that she takes it over. If you get a text message from one of your daughters, you add in her number. And if youāre in an office or at home, you should be able just to wake her up on Alexa as well. The way we prioritize those is really just by how many meeting requests or moments do you end up talking about meetings in that particular channel. Weāve seen that, if we ask our customers what they ask about the most is: one, Slack; two, text messages.
Andrew Weinreich:Ā Tell us quickly just about the limitations of a solution like Calendly and then tell us whether there are others like a company like Fin that have a contextless approach to taking on all verticals at the same time.
Dennis Mortensen:Ā So, thereās a whole pool of solutions like Calendly out there and weāve seen them before and Calendly, just a nicer version of Tungle.me, really. And thereās nothing wrong with that, and in many scenarios for where everything is normal, they might even work just fine. I just donāt like the idea that for some meetings, I can use this solution. For others, I cannot. I like the idea that I reached a moment in my life for where setting up meetings is no longer a job I do. And that means that many of these kind of fixed web interface type settings break if things out of the ordinary. How do you tell a web interface that youāre running late? How do you tell a web interface that Iām going to bring my colleague? How do you tell the web interface that I want to extend it half an hour? How do you tell a web interface that I think we need to change the location? How do you do really all those things that are just very human? But it doesnāt mean that it does not work. It works when things are normal and within the confinements of what a web interface can do. I just donāt think that is what I need. I donāt need to alleviate the pain. I need it to disappear. And suddenly, our bet is one for where we reached that moment for where an agent can be built for where it can disappear in total.
Andrew Weinreich:Ā Just to close, tell us about whether thereās this opposing view to the one you articulated before, that a company like Fin, or maybe there are others that have sort of a contextless approach we can solve all types of problems?
Dennis Mortensen:Ā Again, I donāt think thereās anything wrong with that. Iām just more a fan of finding certain verticals for where at least, thereās this possibility for where you can solve it in full. Smart people have taken the approach that it might not be solvable. As in, weāll attack multiple verticals and theyāre not machines solvable. Weāll apply machine learning and make it more efficient. And then, weāll have humans kind of sit back, and verify, and participate, but thatās really just outsourcing V2. And again, you can create a $20 million business on that. And you should do so. And weāve seen that many times over. And I think it can create a great service. But, what you see is whenever you apply humans into the equation, it becomes a luxury. As in, as far as I believe, Fin charges $60 an hour, and theyāll take 10,15 minutes to set up a meeting. And all of a sudden, it canāt be for everybody. It can be for people for where, Iām willing to pay that and my invoice now is $700, and youāre okay with that.
But I really like this idea that me giving you an email, if we end up working together, you wonāt see that as a luxury. That is seems like āWhy wouldnāt you?ā As in, Iām going to get an email, a calendar, a laptop, a keycard to the office. As in, thatās just necessities to do reasonable decent job. I want Amy and Andrew and this whole idea of setting up meetings ā itās not a luxury. Thatās just a given. As in, why would I hire somebody at a six figure salary and then say, āHey, by the way, I want you to do email pingpong at the office trying to set up meetings with our customers, or leads, or prospects, or whatever that might be, candidates, and so on and so forth.ā No, I donāt. If youāre a recruiter, I want you to speak to our candidates. If youāre a sales person, I want you to speak to our leads. If you are an account manager, I want you to speak to our customers. The setting up the meetings doesnāt add any value.
Andrew Weinreich:Ā Dennis, before we break, any numbers you can give us about the number of people using x.ai?
Dennis Mortensen:Ā Those arenāt public just yet but I can certainly tell you that we are setting up hundreds of thousands of meetings. But the funny thing is, even as proud as I am of hundreds of thousands of meetings as in real people are meeting as we speak because Amy and Andrew kind of set up the meeting, it is still us having pretty much zero percent penetration. So, thereās about 10 billion formal meetings being set up in the US alone every year. Meaning that if I do a million meetings, Iām at zero percent. As in, that is awesome, isnāt it? Thereās just so much to attack here.
Andrew Weinreich:Ā Dennis, thank you.
Jeremy Levy:Ā Thank you so much. Really appreciate it.
Dennis Mortensen:Ā Cheers.