AI founders will learn the bitter lesson

lukaspetersson.com

311 points by gsky 19 hours ago


CharlieDigital - 18 hours ago

There's only one core problem in AI worth solving for most startups building AI powered software: context.

No matter how good the AI gets, it can't answer about what it doesn't know. It can't perform a process for which it doesn't know the steps or the rules.

No LLM is going to know enough about some new drug in a pharma's pipeline, for example, because it doesn't know about the internal resources spread across multiple systems in an enterprise. (And if you've ever done a systems integration in any sufficiently large enterprise, you know that this is a "people problem" and usually not a technical problem).

I think the startups that succeed will understand that it all comes down to classic ETL: identify the source data, understand how to navigate systems integration, pre-process and organize the knowledge, train or fine-tune a model or have the right retrieval model to provide the context.

There's fundamentally no other way. AI is not magic; it can't know about trial ID 1354.006 except for what it was trained on and what it can search for. Even coding assistants like Cursor are really solving a problem of ETL/context and will always be. The code generation is the smaller part; getting it right requires providing the appropriate context.

jonnycat - 18 hours ago

I think this argument only makes sense if you believe that AGI and/or unbounded AI agents are "right around the corner". For sure, we will progress in that direction, but when and if we truly get there–who knows?

If you believe, as I do, that these things are a lot further off than some people assume, I think there's plenty of time to build a successful business solving domain-specific workflows in the meantime, and eventually adapting the product as more general technology becomes available.

Let's say 25 years ago you had the idea to build a product that can now be solved more generally with LLMs–let's say a really effective spam filter. Even knowing what you know now, would it have been right at the time to say, "Nah, don't build that business, it will eventually be solved with some new technology?"

timabdulla - 18 hours ago

I think one thing ignored here is the value of UX.

If a general AI model is a "drop-in remote worker", then UX matters not at all, of course. I would interact with such a system in the same way I would one of my colleagues and I would also give a high level of trust to such a system.

If the system still requires human supervision or works to augment a human worker's work (rather than replace it), then a specific tailored user interface can be very valuable, even if the product is mostly just a wrapper of an off-the-shelf model.

After all, many SaaS products could be built on top of a general CRM or ERP, yet we often find a vertical-focused UX has a lot to offer. You can see this in the AI space with a product like Julius.

The article seems to assume that most of the value brought by AI startups right now is adding domain-specific reliability, but I think there's plenty of room to build great experiences atop general models that will bring enduring value.

If and when we reach AGI (the drop-in remote worker referenced in the article), then I personally don't see how the vast majorities of companies - software and others - are relevant at all. That just seems like a different discussion, not one of business strategy.

NameError - 18 hours ago

I think the core problem at hand for people trying to use AI in user-facing production systems is "how can we build a reliable system on top of an unreliable (but capable) model?". I don't think that's the same problem that AI researchers are facing, so I'm not sure it's sound to use "bitter lesson" reasoning to dismiss the need for software engineering outright and replace it with "wait for better models".

The article sits on an assumption that if we just wait long enough, the unreliability of deep learning approaches to AI will just fade away and we'll have a full-on "drop-in remote worker". Is that a sound assumption?

bko - 17 hours ago

It's a little depressing how many high valued startups are basically just wrappers around LLMs that they don't own. I'd be curious to see what percentage of YC latest batch is just this.

> 70% of Y Combinator’s Winter 2024 batch are AI startups. This is compared to -57% of YC Summer 2023 companies and ~32% from the Winter batch one year ago (YC W23).

The thinking is, the models will get better which will improve our product, but in reality, like the article states, the generalized models get better so your value add diminished as there's no need to fine tune.

On the other hand the crypto fund made a killing off of "me too" block chain technology before it got hammered again. So who knows about 2-5 year term but 10 year almost certainly won't have these billion dollar companies that are wrappers around LLMs

https://x.com/natashamalpani/status/1772609994610835505?mx=2

resiros - 18 hours ago

The author discusses the problem from the point of engineering, not from business. When you look at it from business perspective, there is a big advantage of not waiting, and using whatever exists right now to solve the business problem, so that you can get traction, get funding, grab marketshare, build a team, and when the next day a better model will come, you can rewrite your code, and you would be in a much better position to leverage whatever new capabilities the new models provide; you know your users, you have the funds, you built the right UX...

The best strategy from your experience, is to jump on a problem as soon there is opportunity to solve it and generate lots of business value within the next 6 months. The trick is finding that subproblem that is worth a lot right now and could not be resolved 6 months ago. A couple of AI-sales startups "succeeded" quite well doing that (e.g. 11x), now they are in a good position to build from there (whether they will succeed in building a unicorn, that's another question, it just looks like they are in a good position now).

9dev - 18 hours ago

Well. We were working on a search engine for industry suppliers since before the whole AI hype started (even applied to YC once), and hit a brick wall at some point were it got too hard to improve search result quality algorithmically. To understand what that means: We gathered lots of data points from different sources, tried to reconcile that into unified records, then find the best match for a given sourcing case based on that. But in a lot of cases, both the data wasn’t accurate enough to identify what a supplier was actually manufacturing, and the sourcing case itself wasn’t properly defined, because users found it too hard to come up with good keywords for their search.

Then, LLMs entered the stage. Suddenly, we became able to both derive vastly better output from the data we got, and also offer our users easier ways to describe what they were looking for, find good keywords automatically, and actually deliver helpful results!

This was only possible because AI augments our product well and really provides a benefit in that niche, something that would just not have been possible otherwise. If you plan on founding a company around AI, the best advice I can give you is to choose a problem that similarly benefits from AI, but does exist without it.

leviliebvin - 16 hours ago

Controversial opinion: I don't believe in the bitter lesson. I just think that the current DNN+SGD approaches are just not that good at learning deep general expressive patterns. With less inductive bias the model memorizes a lot of scenarios and is able to emulate whatever real work scenario you are trying to make the model learn. However it fails to simulate this scenario well. So it's kind of misleading to say that it's generally better to have less inductive bias. That is only true if your model architecture and optimization approach are just a bit crap.

My second controversial point regarding AI research and startups: doing research sucks. It's risky business. You are not guaranteed success. If you make it, your competitors will be hot on your tail and you will have to keep improving all the time. I personally would rather leave the model building to someone else and focus more on building products with the available models. There are exceptions like finetuning for your specific product or training bespoke models for very specific tasks at hand.

DesaiAshu - 14 hours ago

This (particularly the figure 1 illustration) discounts the "distribution" layer for apps

Single app/feature startups will lose (true long before AI). A few will grow large enough to entrench distribution and offer a suite of services, creating defensibility against competitors

The distributors (eg. a SaaS startup that rapidly landed/expanded) will continue to find bleeding edge ways to offer a 6-12mo advantage against foundation models and incumbents

GitLab is a great example of this model. The equivalent bitter lesson of the web is that every cutting edge proprietary technology will eventually be offered free open source. However, there is a commercial advantage to purchasing the bleeding edge features with a strong SLA and customer service

The mistake is to think technology is a business. Business has always been about business. Good technology reduces the cost of sale (CAC) and cost of goods sold (COGS) to create a 85-90% margin. Good technology does not create a moat

Resilient businesses do not rely on singular technology advantages. They invest heavily in long term R&D to stay ahead of EACH wave. Resting on one's laurels after catching a single wave, or sitting out of the competition because there will be bigger waves later, are both surefire ways to lose the competition

DebtDeflation - 18 hours ago

>Eventually, you’ll just need to connect a model to a computer to solve most problems - no complex engineering required.

The word "eventually" is doing a lot of work here. Yes, it's true in the abstract, but over what time horizon? We have to build products to solve today's problems with today's technology, not wait for the generalized model that can do everything but may be decades away.

tinco - 18 hours ago

This might be true on a very long timescale, but that's not really relevant for VC's. Literally every single VC I've talked to raised the question if our moat is not just having better prompts, it's usually the first question. If a VC really invested in a company whose moat got evaporated by O1, that's on the VC. Everyone saw technology like O1 coming from a mile away.

For the slightly more complex stuff, sure at some point some general AI will probably be able to do it. But with two big caveats, the first being: when? and the second being: for how much?.

In theory every deep and wide enough neural network should be able to be trained to do object detection in images, yet no one is doing that. Technologies specifically designed to process images, like CNN's, reign supreme. Likewise for architectures of LLM's.

At some point your specialization might become obsolete, but that point might be a decade or more from now. Until then, specializations will have large economic and performance advantages making the advancements in AI today available to the industry of tomorrow.

I think it's the role of the VC to determine not if there's an AI breakthrough behind a startups technology, but if there's a market disruption and if that market disruption can be leveraged to establish a dominant company. Similar to how Google leveraged a small and easily replicable algorithmic advantage into becoming one of the most valuable companies on earth.

doctorpangloss - 18 hours ago

More computation cannot improve the quality or domain of data. Maybe the bitter lesson lesson is, lobby bitterly, for copyright laws that favor what you are doing, and weakened anti trust, to give you the insurmountable moat of exclusive data in a walled garden media network.

scosman - 17 hours ago

Great essay. This is 100% right about the technical side, but I think it misses the “product” aspect.

Building a quality product (AI or otherwise), involves design and data at all levels: UX, on-boarding, marketing, etc. The companies learning important verticals and getting in the door with customers will have a pretty huge advantage as models get better. Both in terms of install base, and knowing what customers need. Really great products don’t simply do what a customer asks, but are built by taking to a ton of customers over and over, and solving their problems better than any one of them can articulate.

It’s true we will need less and less custom software for problems. But it isn’t realistic to say the software wrapper effort is going to zero when models improve.

Plus: a lot of software effort is needed for getting the data AI needs. This is going to be a huge area - think Google maps with satellites, camera cars, network effect products (ratings), data collection (maps, traffic), etc.

2pk03 - an hour ago

Right. It boils down to federated learning on edge to have access to relevant data.

iandanforth - 16 hours ago

I disagree with the author based on timelines and VC behaviour. There is sufficient time to create a product and raise massive capital before the next massive breakthrough hands the value back to OpenAI/Google/Anthropic/MS. Secondly the execution of a solution in a vertical is sufficiently differentiating that even if the underlying problem could be solved by a next gen model there's very little reason to believe it will be. Big Cos don't have the interest to attack niches while there are billion user markets to go after. So don't build "photo share with AI", build "plant fungus photo share for farmers".

kopirgan - 18 hours ago

May be I'm being dumb but isn't the AI approach somewhat like brute force hacking of password? I mean humans don't learn that way - yes we do study code to be better coders but not in the way AI learning does?

Will someone truly discover a way to start from fundamentals and do better not just crunch zillions of petabytes faster and faster.

Or is that completely wrong? Happy to stand corrected

anonu - 3 hours ago

I've observed that every time OpenAI makes an announcement, 2 dozen startups die. The pace was quite rapid a year ago but has slowed down now. I've advised startups of this "bitter lesson". The value is in proprietary data and your ability to build a moat around domain expertise in a specific industry vertical.

theptip - 14 hours ago

I think this misses a crucial dynamic. It’s not “custom scaffolding vs. wait for better models”. There is also fine-tuning.

In specialized domains you can’t just rely on OpenAI finding all the training data required to render your experts obsolete.

If you can build a data flywheel you can fine-tune models and build that lead into a moat; whoever has the best training set will have the best product. In the short term you might start fine-tuning on OpenAI but once you’ve established your dataset you can potentially gain independence by moving onto OSS models.

If you are building a software assistant, sure, this is clearly something that OpenAI will get better at very quickly, and Altman has commented as much that many companies are building things which are certain to get steamrolled by core product development. But I think there is a very interesting strategic question around areas like law, medicine, and probably more so niche technical areas, where OpenAI will/can not absorb knowledge as quickly.

jillesvangurp - 17 hours ago

I think this is largely right. I look a this space as somebody that plugs together bits and pieces of software components for a living for a few decades. I don't need to deeply understand how each of those things work to be effective. I just need to know how to use them.

From that point of view, AI is more of the same. I've done a few things with the OpenAI apis. Easy work. There's not much to it. Scarily simple actually. With the right tools and frameworks we are talking a few lines of code mostly. The rest is just the usual window dressing you need to turn that into an app or service. And LLMs can generate a lot of that these days.

The worry for VC funded companies in this space is that a lot of stuff is becoming a commodity. For example, the llama and phi models are pretty decent. And you can run them yourself. Claude and OpenAI are a bit better and larger so you can't run them yourself. But increasingly those cheap models that you can run yourself are actually good enough for a lot of things. Model quality is a really hard to defend moat long term. Mostly the advantage is temporary. And most use cases don't actually need a best in class LLM.

So, I'm not a believer in the classic winner takes all approach here where one company turns into this trillion dollar behemoth and the rest of the industry pays the tax to that one company in perpetuity. I don't see that happening. The reality already is that the richest company in this space is selling hardware, not models. Nvidia has a nice (temporary) moat. The point of selling hardware is that you want many customers. Not just a few. And training requires more hardware than inference. So, Nvidia is rich because there are a lot of companies busy training models.

elisharobinson - 18 hours ago

this is not only not true , it has no basis's in reality. In the "real" world there are tradeoff's and constraints. scaling does not work infinitely , and products which are delivered by good engneering cultures have a non linear growth ( bad vs very good ).

comming back to research one of the frontier model's deepseek was able to come close to SOTA with a relatively small budget because of one of their mixture of experts approach.

diziet_sma - 18 hours ago

> We’ve seen this pattern in speech recognition, computer chess, and computer vision.

I think the bitter lesson is true for AI researchers, but OP overstates it's relevance to actual products. For example the best chess engines are very much still very much specific to chess. They incorporate more neural networks now, but they are still quite specific.

JohnCClarke - 18 hours ago

The smarter AI app founders know that more advanced AI will easily replace their current tech. What they're trying to do now is lock in a userbase.

moomin - 18 hours ago

The thing is, what do we do with the bitter lesson once we’re essentially ingesting the entire internet? More computation runs the risk of overfitting and there just isn’t any more data. Is the bitter lesson here telling me that we’ve basically maxed out?

danielmarkbruce - 3 hours ago

The current, intricate, post training / fine tuning going on in models like 0-1 is feature engineering.

highfrequency - 14 hours ago

One day LLMs may replace traditional search engines… in the meantime Google has built a $2 trillion business on specialized engineering and millions of human-designed features and optimizations.

The Bitter Lesson is an elegant asymptotic result. But from a business perspective it pays to distinguish problems that general deep learning approaches will disrupt in 1 year vs. 5 years vs. 30 years.

motbus3 - 16 hours ago

I agree the article presents good points specially if you consider current LLM models and the business models (and questionable business practices) being used.

It seems unlikely at this point, but it it might be the case that new methods, models and/or algorithms rise due to the trend of having high expectations/investments on this area

But yes, AI business stuff will be Swallowed by those companies as much as the companies that unwillingly are providing content to those companies.

osigurdson - 15 hours ago

I does feel like we are having the petfood.com moment in B2B AI. Bespoke solutions, bespoke offerings for very narrow B2B needs. Of course waiting around for AI to get so good that no bespoke solution is needed might be a bad strategy as well. I'm not sure how it will play out but I am certain there will be significant consolidation in the B2B agent space.

mike_hearn - 18 hours ago

Of course, 01 is a lot more expensive than smaller models. So the startups that have spent time on tuning their prompts May yet get the last laugh, at least in any competitive market where customers are price sensitive. Bear in mind that supposedly OpenAI is losing money even at a $200 a month price point so it's unclear that the current cost structure of model access is genuinely sustainable.

mlepath - 13 hours ago

The author appears to be confused about the difference between research and production. In research more generic approaches typically win because they get resourced much better (plus the ever-growing compute and data have helped, but there is no guarantee that these will continue).

On production side of "AI" (I don't love the term being thrown around this loosely as true AI should include planning, etc not just inference) the only question is how well do you solve the one problem that's in front of you. In most business usecases today that problem is narrow.

LLMs drive a minuscule (but growing) amount of value today. Recommender systems drive huge amount of value. Recommender systems are very specialized.

sd9 - 17 hours ago

I think there is value in companies built around AI. The value comes from UX, proprietary supplementary datasets, and market capture. Businesses built now will be best positioned to take advantage of future improvements in general AI. That is why I am building in the AI space. I’m not naive to the predictable improvement in foundational models.

mercurialsolo - 18 hours ago

love it when a 25 year old founder says we have been here multiple times in the past

Havoc - 17 hours ago

Conceptually that seems right, but think there is a decade+ worth of runway left on working in the "glue" space. i.e. connecting the AI and our lived realities.

I don't see the general-ness of AI overcoming the challenges there...cause there is a lot of non-obvious messiness there.

ano-ther - 18 hours ago

So what is the appropriate course of action if you are a founder? Just wait until the models inevitably get better, or help a customer with their problem now?

Computer power has been increasing all the time, but that hasn't kept people from experimenting with the limited power of their time (which ultimately led to better solutions) rather than waiting for the more powerful machines.

ryanackley - 17 hours ago

I feel like the author glosses over some nuance in the point he is trying to make and his conclusion doesn't incorporate the nuance.

The nuance is that even though AI researchers have learned this lesson, they are still building purpose built AIs because it's not possible to build an AI that can learn to do anything (i.e. AGI). Therefore, building on top of an existing AI model to meet a vertical market demand is not that crazy.

It's the same risk when building on any software platform. That is the provider of the platform may add the feature/app you are building as a free addition and your business becomes obsolete.

jvanderbot - 16 hours ago

TFA admits that specialization can gain a temporary edge. But, says using that specialization is useless because the next gen will eclipse that edge using raw CPU.

Even if it is true that the next generational change in AI is based on computational improvements, how can it be true that it's hopeless to build products by specializing this generation of tech?

Moreover if I specialize gen 1, and gen 2 is similar enough, can't specialization maintain an edge?

There seems to be a timescale mismatch.

asdev - 14 hours ago

The main point here seems to be around boxing in the AI with a rigid set of rules, and then not being able to adapt to model improvements over time. I think if those "rules" are just in the prompt, you can still adapt, but if you're starting to code a lot of rule based logic in the product, you can get into trouble.

k__ - 18 hours ago

I'm not an AI founder, but I'd say there is still some value to be added by better architectures.

These AI wrappers run ship engines on a kayak. For example, most coding companion's ignore my code base the moment I close other files. Somehow their context is minimal. Such systems can be improved dramatically just by changing the software around the LLM.

But I get it, you have to be quick and don't waste time on MVPs. If you get critical mass, you can add that later... hopefully

bhouston - 17 hours ago

In order to win you often have to start before the problem is fully solved and then count on it being solved better as you build and scale.

Thus starting with engineering effort to fit some of the AI limitations make sense but realize many of those will be temporary and will be replaced in the future.

But there is always a new tech or framework or something that emerges after you start and adopting it will improve your product measurably.

rnamerl - 13 hours ago

I think the goal is just to get as much funding from VCs and government rackets as possible.

The previous administration had Harris as the "AI czar". Which means that nothing was expected to happen.

The following administration has Sacks as the "Crypto & AI czar". I'm not aware that Sacks has any particular competence in the AI area, but he has connections. So government money is likely to flow, presumably to "defense" startups.

This All-In podcast has paid off: From peddling gut bacteria supplements and wrong explanations on how moderator rods work in nuclear power plants to major influence.

tippytippytango - 13 hours ago

The bitter lesson is a powerful heuristic, but its adherents treat it as dogma and wield it as a club to win arguments.

alganet - 17 hours ago

There is a circular argument going on in this article. Basically:

> "Flexible is better. Why? Because specific has been consistently worse"

I mean, I don't deny the historical trend. But really, why is it so? It makes sense to follow the trend, but knowing more about the reason why would be cool.

Also, I feel that the human cognitive aspects of "engineering some shit" are being ignored.

People are engineering solutions not only to be efficient, but to get to specific vantage points in which they can see further. They do it so they can see what the "next gen flexible stuff" looks like before others.

Finally, it assumes the option to scale computation is always available and ignores the diminishing returns of trying to scale vanguard technology.

The scale requirements for AI stuff are getting silly real fast due to this unshakable belief in infinite scaling. To me, they're already too silly. Maybe we need to cool down, engineer some stuff and figure out where the comfortable treshold lies.

crawshaw - 13 hours ago

The bitter lesson is a wonderful essay and well worth a read: http://www.incompleteideas.net/IncIdeas/BitterLesson.html

I also believe the original bitter lesson essay is correct. General methods on more powerful compute will supersede anything specific built today.

However I believe the lesson in this blog post is an incorrect conclusion. It is roughly analogous to "all people die, so do not bother having children because they will die one day." It is true that what you build today will be replaced in the future. That is OK. Don't get caught out when it happens. But building specific systems today is the only way to make progress with products.

Waymo started some 12 years ago as Google Chauffeur, built on all sorts of image/object recognition systems that I am sure have fallen to the bitter lesson. Should Google have refused to start a self-driving car project 12 years ago, because generalized transformers would replace a lot of their work in the future? No. You have to build products with what you've got, where you are. Adopt better tools when they come along.

keybored - 18 hours ago

> “The biggest lesson that can be read from 70 years of AI research is that general methods that leverage computation are ultimately the most effective, and by a large margin”.

> He points out that throughout AI’s history, researchers have repeatedly tried to improve systems by building in human domain knowledge. The “bitter” part of the title comes from what happens next: systems that simply use more computing power end up outperforming these carefully crafted solutions. We’ve seen this pattern in speech recognition, computer chess, and computer vision. If Sutton wrote his essay today, he’d likely add generative AI to that list. And he warns us: this pattern isn’t finished playing out.

According to Chomsky (recalled in relatively recent years for him) this is why he didn’t want to work on the AI/linguistics intersection when someone asked him in the mid 50’s. Because he thought that successful approaches would just use machine learning and have nothing to do with linguistics (that he cares about).

kubb - 18 hours ago

They will learn their lesson how exactly? By getting money for free from VCs?

Remember, most of these startups are grifters. Only few of them really believe in their products.

CaptainFever - 18 hours ago

Cool article, a lot more technical and informational than I thought from the headline.

The article gave the TL;DR as below, for those who skip to the comments:

    Historically, general approaches always win in AI.
    Founders in AI application space now repeat the mistakes AI researchers made in the past.
    Better AI models will enable general purpose AI applications. At the same time, the added value of the software around the AI model will diminish.
danielovichdk - 16 hours ago

AI founders will also make sure that users - the ones that are not capable of telling if you're lying to them by taking their money and energy - will learn the bitter lesson of taking it right up the arse. Again. Just like a repetition of history. Humanity is like ever before caught in the action of serving the individuals that made us fuck ourselves over. And the only thing you do is thanking them for it by applying to their tactics. Middle class humanity and above is morally bankrupt, and AI is the next thing they will fight over and with, but of course without adding anything positive to the world.

AlienRobot - 13 hours ago

I wish I knew what "AI" is.

Please correct me if I'm wrong, but to my understanding, every single one of these "AI" products is based on a model, i.e. it's just a normal program with a very big configuration file. The only thing "AI" about it is the machine learning performed when training the model. At the generation stage, there is nothing "AI" about it anymore. The AI is done already. What runs is a normal program.

I know it's used in computer vision, recommendation algorithms, and generative software like ChatGPT, Stable Diffusion, etc. I don't know if there is anything besides these.

From what I know, the biggest problems with AI is: 1) the program pretends to be intelligent, when it isn't, e.g. by using natural language. And 2) the program doesn't give the user enough control, they only get one text box prompt and that's it, no forms, no dialogs, so the product has to somehow generate the right answer from the most uncontrolled piece of data imaginable.

These two things combined not only limit the products potential by giving it unreasonable expectations but also makes it feel a bit of a scam: the product is a program, but what sells is its presentation: it has to present itself as being MORE than just a program by hiding as much of its internal workings from the consumers and covering its warts with additional layers of logic.

I don't know if the author would consider natural language to be a "hardcoding" create to temporarily solve a problem that should be solved by using more compute (AGI), but to me it feels like it is.

The best application of AI is probably going to be using AI internally to solve existing problems in a given domain your business is familiar with rather than try to come up with some AI solution that you have to sell other people.

EGreg - 15 hours ago

I was going to keep this to myself to maintain a competitive advantage, but I will just drop two hints:

1) Have AI turn natural language interactions into programs

2) Use test-driven development on the domain data

There is a third thing that’s far more crucial but if you want to find out what it is, contact me. I’m building a general-purpose solution for a few domains.

Unlike the other two platforms I built (Web 2.0 [1] and Web 3.0 [2]) I am not planning to open-source this project. So I don’t want to discuss more. I will however say that open source is a huge game-changer — because organizations / customers want to be able to be in control and you don’t want to be reliant on some third-party platform for an API provider. We all know how well that one turned out for startups building on Web 2.0 platforms.

1. https://github.com/Qbix

2. https://github.com/Intercoin

Devasta - 17 hours ago

This assumes that the AI bros care about building useful products. They don't for the most part, they care only about building a startup that can plausibly get VC funding. So what if in five years your LLM on the blackchain app fails, you got to blow throw a few million dollars over a few years going around to conferences and living the high life.

That's a success in anyone's book.

xchip - 14 hours ago

thanks for the tl;dr

quantumgarbage - 18 hours ago

[dead]

darepublic - 15 hours ago

Dunno why anyone in the startup space would be excited about new more powerful AI models unless they are just throwing a mask over their existential fears