Leaving Meta and PyTorch

soumith.ch

717 points by saikatsg 4 days ago


cs702 - 3 days ago

Many of the comments here are judging PyTorch in hindsight, which is unfair.

When Soumith Chintala co-launched PyTorch, and for many years after, the alternatives for fast, interactive, convenient development were much worse. There was no Jax.

Every single AI researcher I know, including me, who tried PyTorch back then immediately wanted to switch to it, because it was so much better. Andrej Karpathy described what PyTorch felt like back then when he tweeted, on May 2017, "I've been using PyTorch a few months now and I've never felt better. I have more energy. My skin is clearer. My eyesight has improved."[a]

THANK YOU SOUMITH for your hard work over all these years! Your hard work has made a difference for a huge number of people, including many of us here on HN.

We wish you success in your future endeavors, whatever they turn out to be!

Please ignore all the petty criticism.

---

[a] https://x.com/karpathy/status/868178954032513024

jeffreysmith - 3 days ago

I'm one of the many people who Soumith hired to Meta and PyTorch. I had the privilege of working on PyTorch with him and lots of the folks on this post.

As his longtime colleague, the one thing I would want people to know about him and this decision is that Soumith has always viewed PyTorch as a community project. He consistently celebrated the contributions of his co-creators Adam and Sam, and he extended the same view towards the Yangqing and the Caffe2 crew that we merged into PyTorch. At the very beginning, by Soumith's highly intentional design, PyTorch was aimed at being truly developed by and for the AI research community and for many years that was the key way in which we grew the framework, FB PT team, and the wider community. At every single stage of PT's lifecycle, he always ensured that our conception of PT and its community grew to include and celebrate the new people and organizations growing what was possible with PT. He's an incredible talent magnet, and thus more and more smart people kept dedicating their blood, sweat, and tears to making PT bigger and better for more people.

I've worked with some very well known and highly compensated leaders in tech, but *no one* has done the job he has done with ameliorating a bus factor problem with his baby. PT has a unique level of broad support that few other open source technology can reach. In a world of unbounded AI salaries, people who want to move AI research methods forward still freely give their time and attention to PyTorch and its ecosystem. It's the great lever of this era of AI that is moving the world, *due in large part* to the strength of the community he fostered and can now let continue without his direct involvement.

His departure is the end of an era, but it's also operationally a true non-event. PyTorch is going strong and can afford to let one of its creators retire from stewardship. This is precisely what success looks like in open source software.

He deserves our congratulations and our thanks. Enjoy your PT retirement, man.

utopiah - 4 days ago

What I find most interesting with this is that it shows they believe there is nothing unique at Meta related to AI. There is no resource, people and computing power, that they can't get elsewhere for whatever they believe would be more interesting for them.

I mention this because it feels analogous to military research, where people "dream" of how advanced the military is, how forward they are compared to public research... and yet, it seems to be a recurring myth they love to sustain.

So the signal I get here is AI "labs" in BigTech have nothing worth waiting for around the corner, it's just more of the same and boring for people who stick there.

vintermann - 4 days ago

That man has an infective enthusiasm. I remember the DCGAN paper inspired me to try getting the (Lua) Torch code to work, and I tried it on the Oxford flowers dataset early on. It worked surprisingly well, and Soumith Chintala even shared it around in social media, surprised at how well it worked on such a small dataset. Of course back then we didn't really appreciate the problem of mode collapse.

Pytorch and old Lua Torch were a pleasure to work with compared to the contemporary Tensorflow. Lots of S.C's code was copied around liberally, it had its quirks (I remember the DCGAN code had a pretty odd way of doing parameter passing) but it was also really easy to understand and made random people like me feel like we had suddenly stumbled onto something crazy powerful (which we had!). It was wonderfully hackable.

qmatch - 4 days ago

As a loyal JAX user, I hope they can play catchup. PyTorch has dominated the AI scene since TF1 fumbled the ball at 10th yard line. What Matt Johnson has done turning Autograd into JAX is hopefully going to be worthy of as much praise as what Soumith has received.

mmaunder - 3 days ago

“ What's next for me? Something small. Something new. Something I don't fully understand yet. Something uncomfortable. I could have moved to something else inside Meta. But I needed to know what's out there. I needed to do something small again. I couldn't live with the counterfactual regret of never trying something outside Meta.”

Shades of Siddhartha. Back to the forest.

aabhay - 4 days ago

For anyone that’s curious, the underlying Torch library is also a joy to work with, as are the many other torch bindings. For example, Rust has tch and Burn which both work with libtorch.

PyTorch of course has the benefit of being dynamically debuggable. Can’t forget the first time I break pointed my pytorch model and wrote pytorch calls inside the terminal to inspect the behavior. That’s still something I miss a lot now that I’m working with only “fast” compiled code.

chopete3 - 4 days ago

>>Every major AI company and hardware vendor are on a speed dial. This kind of power is really hard to give up. But curiosity ultimately won out in my head.

A simple feeling has such a power. May he gets an opportunity to create one more powerful tool before retiring.

mxkopy - 4 days ago

PyTorch is one of those tools that’s so simple and easy to take apart that you feel like you might’ve been able to make it yourself. I can’t imagine how much engineering effort was behind all those moments where I thought to myself, “of course it should work like that, how can it be any other way?”

odyssey7 - 3 days ago

> It's taught in classrooms from MIT to rural India. The tools I dreamed about making accessible? They are. The barrier to entry I wanted to lower? It's almost gone.

I have an ironic sense that there are classrooms in rural India with better pedagogy and lower barriers to entry than some of our elite engineering programs.

jmward01 - 3 days ago

All I can say is 'thanks!'. It does take a team, and a community, but individuals matter. I use pytorch daily, it has made it possible for me to play with ideas that I would have only dreamed of. It is a big part of my life so, thanks for your contributions and best of luck on the next thing!

ergocoder - 4 days ago

I wonder how much this guy has earned from Meta in total. Would it reach $100M?

xpe - 3 days ago

It is notable (but perhaps not surprising) that this is mostly about the people and the work itself. The writing is silent on the downstream impacts on the world. In contrast, there are fields (global development, medicine, etc.) where people tend to focus on the impact on humanity (especially when reaching a milestone in their career).

sumedh - 4 days ago

His homepage says he wants to build a robot. So he is probably going to work with robots for his next role.

He is an investor in Anthropic, didnt know you could do that working for Meta.

gdiamos - 4 days ago

This is the end of an era. Amazing work soumith.

shevy-java - 3 days ago

To me it sounds as if he is trying to open a new chapter in his life. Good for him, but I wonder if everything was really as smooth as described. People often write how everything is always perfect on their blog. Well - could be. But it could also be that not everything was perfect but no description followed on the blog.

sota_pop - 3 days ago

I started my deep learning journey circa 2018 hand crafting autoencoders with Matlab cells. Pretty quickly moving on to Python, I remember reading about PyTorch and TensorFlow. Back then, TF lacked dynamic graphs which made PT way more favorable. Even still, many compared PT to Keras. However I always thought a closer comparison to Keras was something like Fast.ai from Jeremy Howard et. al., which I really enjoyed and don’t see many people use.

In any case, I ended up sticking with PT and am extremely grateful for all the work put into it. Thank you.

irthomasthomas - 4 days ago

Counterfactual Regret Minimization irl

ninjagoo - 3 days ago

Soumith's 2nd release? https://github.com/pytorch/pytorch/releases/tag/v0.1.1

Also, looking at the contribution history for a long career is very interesting; reflects the changing roles over time https://github.com/soumith

philipwhiuk - 4 days ago

There's no context around 'FAIR' - is it https://www.go-fair.org/?

livelaughlove69 - 3 days ago

What an amazing impact this man has had. He seems like a great guy too. I don't know him at all personally but his replies online have always been so nice. I really wish him all the best with whatever he does with his time next

numice - 4 days ago

I read one post on his blog and found that Adam Paszke reached out to the author and got an internship. I wonder if it was that easy to get an internship at FAIR. I thought that they hire only PhDs.

warbaker - 3 days ago

I just want to say: thank you. It is hard to overstate how much Pytorch has accelerated ML/AI development, across the board.

jsrozner - 4 days ago

Is this also partially AI generated? What's with the repeated short phrases? Is this just everyone's style now?

kleiba - 4 days ago

You forgot to thank Jürgen. /scnr

- 4 days ago
[deleted]
perfmode - 4 days ago

Respect.

msmd74 - 4 days ago

Sounds like you had a momentous run.

If you take advice from reformed Internet trolls, consider turning off all your devices and trying to give yourself at least a week, but ideally a month offline staring at your new baby. You'll never get that time back and there's nothing your brain will appreciate more than loading up those memories as they grow.

Good luck.

- 3 days ago
[deleted]
isusmelj - 4 days ago

Very proud as a Swiss that Soumith has a .ch domain!

mupuff1234 - 3 days ago

https://news.ycombinator.com/item?id=45847465

Seems relevant.

w10-1 - 3 days ago

Given the technical achievements and industry impact, I'm struck that his emphasis is on the people, and in particular growing people to take on challenges. His influence might lie as much in what they do. The connection linking them seems to be building curiosity and wonder into delight for users.

If there's a soul to silicon valley, that's it. However many jerks and political/power players I encounter, I remain inspired by the few like this.

hshdhdhehd - 4 days ago

Nice, that is the dream career!

BoredPositron - 4 days ago

The last few years must have been incredibly exhausting. Thanks for your work good luck and 73.

RobRivera - 3 days ago

Good luck, good job, and may more fantastical journeys lay ahead.

CommenterPerson - 3 days ago

Firstly, Good work.

Ironical but one HN front page item today is this: "Meta projected 10% of 2024 revenue came from scams and banned goods, Reuters reports"

Glad you're leaving, hopefully you're in a good place financially. Take a page from Bill Gates and work on something that attempts to improve society. Stay away from surveillance capitalism and enshittification.

sherinjosephroy - 4 days ago

[dead]

Glamklo - 4 days ago

[dead]

mugivarra69 - 3 days ago

[dead]

deaux - 3 days ago

[flagged]

zkmon - 4 days ago

[flagged]

theusus - 4 days ago

[flagged]

jurschreuder - 3 days ago

It's just a python wrapper around Torch, an open source C program by a Swiss non-profit, so META could compete with TensorFlow of Google.

I don't know why this is celebrated so much, a big company rebranding a non-profit for profit.

But I guess that's the norm for AI now.

ishouldbework - 4 days ago

Look, I get that some pages require javascript, but

    <style class="fallback">body{visibility:hidden;white-space:pre;font-family:monospace}</style>
which is then unset by JS, with no <noscript> anywhere, is just... I just get white page.

Changing it to

    <style class="fallback">body{white-space:pre-wrap;font-family:monospace}</style>
gives perfectly readable web, so it seem bit... pointless.