More

m_ke · 2025-08-05T17:36:46 1754415406

No they’re usually done at each attention layer.

shpongled · 2025-08-05T17:51:14 1754416274

Do you know when this was introduced (or which paper)? AFAIK it's not that way in the original transformer paper, or BERT/GPT-2

spott · 2025-08-05T18:43:52 1754419432

All the Llamas have done it (well, 2 and 3, and I believe 1, I don't know about 4). I think they have a citation for it, though it might just be the RoPE paper (https://arxiv.org/abs/2104.09864).

I'm not actually aware of any model that doesn't do positional embeddings on a per-layer basis (excepting BERT and the original transformer paper, and I haven't read the GPT2 paper in a while, so I'm not sure about that one either).

shpongled · 2025-08-05T20:08:12 1754424492

Thanks! I'm not super up to date on all the ML stuff :)

Scene_Cast2 · 2025-08-05T18:06:35 1754417195

Should be in the RoPE paper. The OG transformers used multiplicative sinusoidal embeddings, while RoPE does a pairwise rotation.

There's also NoPE, I think SmolLM3 "uses NoPE" (aka doesn't use any positional stuff) every fourth layer.

Nimitz14 · 2025-08-05T18:20:17 1754418017

This is normal. Rope was introduced after bert/gpt2

m_ke · 2025-08-01T17:56:54 1754071014

Would be interesting if this was a coding focused model optimized for Mac inference. Would be a great way to undercut Anthropic.

Pretty much give away Sonnet level coding model and have it work with GPT-5 for harder tasks / planning.

CharlesW · 2025-08-01T18:12:10 1754071930

Out of curiosity, have you tried running Qwen3 Coder 30B locally? https://huggingface.co/unsloth/Qwen3-Coder-30B-A3B-Instruct-...

stavros · 2025-08-01T23:39:33 1754091573

Not the GP, but I haven't, how is it? I use Claude Code with Sonnet, does Qwen3 compare?

CharlesW · 2025-08-02T17:34:22 1754156062

I'm also using Claude Code and am very familiar with it, but haven't had a chance to try Qwen3 Coder 30B A3B for any real-world development. That said, it did well with my "kick the tires" tests, and some reports show that it's comparable to Sonnet (at least before adding the various levels of 'think' directives):

https://llm-stats.com/models/compare/claude-3-7-sonnet-20250...

m_ke · 2025-07-06T18:35:20 1751826920

Judging by the @america feed on twitter it will be all of the fascism with none of the fake MAGA populism. Good luck finding a constituency for that outside of a handful of billionaires and their groupies.

derdi · 2025-07-07T08:26:37 1751876797

He may be very happy just being a spoiler candidate that sucks off enough Republican votes to make them lose the next election.

m_ke · 2025-06-09T01:43:48 1749433428

I've heard from someone who knows that they're scamming people like crazy. Supposedly they also setup a bunch of LLCs to hire influencers then never paid them.

thierrydamiba · 2025-06-09T01:51:15 1749433875

I think a claim like that requires proof. You’re accusing them of fraud.

swiftcoder · 2025-06-09T10:41:51 1749465711

Is fraud not a reasonable assumption, when an app that fundamentally does not do what it claims to, nonetheless has legions of glowing reviews?

In the best scenario we are in a TornadoGuard (https://xkcd.com/937/) situation. More likely the developers are paying for reviews

m_ke · 2025-05-22T19:31:05 1747942265

Or just dump pydantic and use msgspec instead: https://jcristharif.com/msgspec/

mbb70 · 2025-05-22T19:54:32 1747943672

A great feature of pydantic are the validation hooks that let you intercept serialization/deserialization of specific fields and augment behavior.

For example if you are querying a DB that returns a column as a JSON string, trivial with Pydantic to json parse the column are part of deser with an annotation.

Pydantic is definitely slower and not a 'zero cost abstraction', but you do get a lot for it.

jtmcivor · 2025-05-22T21:34:26 1747949666

One approach to do that in msgspec is described here https://github.com/jcrist/msgspec/issues/375#issuecomment-15...

itamarst · 2025-05-22T19:36:43 1747942603

msgspec is much more memory efficient out of the box, yes. Also quite fast.

aitchnyu · 2025-05-23T07:30:55 1747985455

Can it do incremental parsing? Cant tell from a brief look.

jtmcivor · 2025-05-23T15:36:15 1748014575

IIUC:

* You still need to load all the bytes into memory before passing to msgspec decoding

* You can decode a subset of fields, which is really helpful

* Reusing msgspec decoders saves some cpu cycles https://jcristharif.com/msgspec/perf-tips.html#reuse-encoder...

Slides 17, 18, 19 have an example of the first two points https://pythonspeed.com/pycon2025/slides/#17

m_ke · 2025-04-12T15:29:06 1744471746

Wow great timing, I just got a $22,000 bill 2 hours ago for a surgery that UHC approved 2 months ago (in a written letter from them) because they refused to pay.

insurancesucks · 2025-04-12T16:08:00 1744474080

I'm on the hook for $128k for a no complications birth and 5 days my newborn had to be on a CPAP machine after blue cross denied the claim. I picked the plan only after confirming all our providers were in network, but failed to check if the building where the delivery was occurring was in network.

The plan at this point is to just ignore it and hope it goes away, since they can't put it on your credit anymore.

deepfriedchokes · 2025-04-12T18:16:58 1744481818

If it doesn’t affect your credit, why would anyone pay? Sounds ripe for an act of mass civil disobedience.

antisthenes · 2025-04-12T19:56:28 1744487788

I personally believe it is our civic duty and non-payment is the most effective non-violent way to show our opposition to the system.

xenospn · 2025-04-12T16:27:05 1744475225

This is the equivalent of going to a restaurant and having the waiter spit in your empty plate and charging you for it. How insanely ridiculous

apexalpha · 2025-04-13T07:28:25 1744529305

>I picked the plan only after confirming all our providers were in network, but failed to check if the building where the delivery was occurring was in network

What?

I'm sorry what kind of kaska-esque system is this?!

ProllyInfamous · 2025-04-13T13:55:20 1744552520

>what kind of kaska-esque system is this?!

It's the system that us Americans are tricked into believing is the best and nOt sOciAlIsM. Certainly USA healthcare is "the best" — if you can afford it!

My personal belief is that the kafkaesque nature of so many systems is designed to keep people destitute and despondent — to quote ole TedK: "our system keeps people demoralized because a demoralized person won't fight back."

~"We'll keep them poor and tired; if they're poor they can't afford to fight back, and if they're tired they won't have energy to..."~ —Jeff (Jonestown Massacre)

Having dropped out of a US medical school (almost two decades ago), I can assure you things have only gotten worse (from a bottom 80% POV). My best method of pyhhric victory is to not reproduce, earn just enough to live minimally (i.e. lessen tax burden/revenue), and never pay for health insurance.

YMMV — I quit, a long time ago.

aaomidi · 2025-04-12T15:36:09 1744472169

I’m so sorry. No one should have to deal with this stress.

It might be worth reaching out to your state (local, not federal) rep and also your state’s insurance commissioner.

bslalwn · 2025-04-12T15:46:22 1744472782

What are your options? I suppose you are liable to pay for the surgery fully and then you have to sue your insurer to try and get the money back?

m_ke · 2025-04-12T15:53:42 1744473222

I have no idea, I tried calling the number on the bill but it gave me a dialer with 8 options of "if you're calling about a bill from X which is now part of Y, please dial N". When I selected 8, which was "all other" I got a canned message telling me to call between 9-5 on a week day.

I'm definitely not paying it

mrangle · 2025-04-12T16:10:24 1744474224

Start by calling billing and telling them what happened, and that you effectively don't have insurance and will be self-paying (said for the purpose of negotiation, not what you may or may not actually do). They should discount it by a lot.

denverllc · 2025-04-12T16:26:05 1744475165

Healthcare providers have starting saying it's "insurance fraud" to say that you don't have insurance when you do.

My guess: they know they can get more money from the insurer than the individual (or a combination of both!) so they want to scare you from not allowing them to negotiate with the insurers.

m_ke · 2025-02-04T13:52:08 1738677128

This is only semi related but I wonder what will happen to these huge hierarchical orgs when the pace of software development improves by 10-20x thanks to LLMs.

How will these risk averse slow moving teams with a ton of process keep up with 100x more tiny teams of engineers who can ship whole features in days instead of months.

postexitus · 2025-02-04T14:06:45 1738678005

You don't have to worry, it's not going to happen. LLMs does/will make individuals more efficient, therefore, reducing number of developers maybe, but you will still have the exact same bottlenecks at the exact same places throttling the delivery speed.

m_ke · 2025-02-04T14:16:21 1738678581

I'm saying there will be 10-100x more small dev shops competing with the big cos. Pizza sized teams that own the whole product and can just ship stuff without the dog and pony show that's common at larger orgs.

lotsoweiners · 2025-02-04T21:12:32 1738703552

Big co buys them. Big co sues them. Big co lobbies to keep them out of their space etc etc. Not everything is a technical challenge.

postexitus · 2025-02-04T14:17:46 1738678666

These still exist and don't make a dent in the big cos balance sheets. They may be growing the pie though.

hobs · 2025-02-04T14:21:38 1738678898

They'll build a billion middle of the road bland messes?

m_ke · 2025-02-04T14:32:56 1738679576

No, some of them will build midjourney with no pressure to sell like instagram did

skybrian · 2025-02-04T14:00:27 1738677627

When one bottleneck is removed, that usually means the rate of change is bottlenecked somewhere else. Maybe in the release process, or testing?

Or maybe the bottleneck is the willingness of customers to try new things? Risk-adverse customers will often avoid startups. Showing yourself to be trustworthy isn’t purely about the rate of feature development.

If the other bottlenecks can’t be removed easily, instead of 10x features you could end up with fewer software developers.

m_ke · 2025-02-04T14:14:24 1738678464

Yes for sure, but from what I've seen at large companies the bottlenecks are already usually caused by intra team conflicts, legal hurdles and "processes" that take something that would have taken a dev with ownership 1-2 days to do and turns it into months long slogs and rituals.

Having worked at early stage startups and mid sized companies there's already a 10-20x productivity gap between them due to this (even on brand new projects at large companies vs startups, where it's not an issue of legacy code).

As an example I just witnessed a large co hire a consulting company to help them "ideate" on a RAG app that barely worked and required 3 rewrites and ~18 months to make it to POC stage, even though a front end dev had a better working POC that he hacked together in a day and a half.

I've heard way worse horror stories from friends at Google / Meta / Apple.

What will happen when tiny startups of 3-8 people get 5-20x more productive and can ship new stuff daily?

gyomu · 2025-02-04T14:18:58 1738678738

> What will happen when tiny startups of 3-8 people get 5-20x more productive and can ship new stuff daily?

The answer is in the comment you just wrote.

If those tiny startups are successful, they will become the next bloated large companies where things take forever because of "intra team conflicts, legal hurdles and processes", which are categories of things LLMs will never solve because LLMs can't solve problems of human consensus.

If those startups aren't successful, they will run out of money and die.

Big companies take forever to do things because they have lots of paying customers to keep happy, a bunch of people who are ready to sue them at the slightest misstep, thousands of employees with families who want job stability and therefore don't want to be betting the farm every 6 months, etc.

Tiny companies can iterate really fast because they have none of this.

LLMs don't change anything about this fundamental reality.

m_ke · 2025-02-04T14:27:47 1738679267

As the cost of going from 0 to 1 goes to 0 the incentives flip. You'll have way more small companies that raise little or no money from VCs and have no incentives to juice head count to pump the valuation.

I have a lot of friends who started similar companies recently, who are making millions in revenue with 2-8 people and deliberately plan to never grow head count past around 10 people.

We'll have way more teams like midjourney, early whatsapp / instagram and 37signals.

gyomu · 2025-02-04T22:20:07 1738707607

Can you show us these companies?

brookst · 2025-02-04T14:08:36 1738678116

Someone’s read The Goal!

100% agree. The SW pipeline is complicated. AI may one day slot into every part and improve velocity, but it will be piecemeal and better at some processes than others for a long while.

m_ke · 2025-01-31T20:21:56 1738354916

I can't imagine what it's like at Meta right now, with the CEO publicly stating that they're firing the bottom 5% of performers and then a week later stating that the LLMs that his researchers / engineers are working on will soon be able to replace them.

Zuck needs Yann LeCun and other senior researchers at Meta a lot more than they need him. If they were to quit there would be a line out the door to hand them as much money as they want to start a competing open research lab. I bet a ton of top researchers from other labs would be happy to join too, since from what I've heard from friends they're all miserable from dealing with incompetent management.

On current trajectory one of Sam Altman / Zuck / Elon will end up having full control over the frontier models that are trained on their huge new clusters. All 3 of them are unaccountable to anyone.

Aurornis · 2025-01-31T21:26:45 1738358805

> the CEO publicly stating that they're firing the bottom 5% of performers

I understand that people don't like any talk about layoffs and performance management, but I've never worked at a company where being in the bottom 5-10% of performers meant your job was safe. I've also never worked at a big company that didn't have at least 1-in-20 people who were clearly underperforming and everyone around them knew it.

I know the real complain is that he said it out loud and people don't like threats. However, Meta employees are highly compensated, especially now that the stock price is extremely high. I don't really think it's unreasonable for a company that compensates well and has generous severance packages to be cutting the bottom 5% of their workforce.

jedberg · 2025-01-31T21:37:13 1738359433

The problem isn't that they are cutting 5%, it's that they use stack ranking. Within a team of 10, you may have the top 10 performers in the whole company, but the manager still has to rank them and assign at least one of them the bottom ranking, or engage in a lengthy battle to defend their high rankings.

They're not actually finding the bottom 5%, they're giving managers an excuse to get rid of people the don't like for whatever reason.

It's also terrible for morale to do it all at once. Sure, maybe there are some underperformers. Let managers deal with those people individually. Don't do a mass layoff where they have to select someone at a specific time when all their people might be doing well.

giantrobot · 2025-01-31T22:45:45 1738363545

> They're not actually finding the bottom 5%, they're giving managers an excuse to get rid of people the don't like for whatever reason.

More insidious is the rankings are capricious and arbitrary despite haughty claims. Unless you're in the top quintile and know so explicitly you can never feel safe in your position. You can also drop into the bottom quintile of the stack for no other reason than someone else on your team self aggrandized a bit more right before reviews.

maxerickson · 2025-02-01T02:56:47 1738378607

Vibes strike again.

alpha_squared · 2025-01-31T21:38:41 1738359521

Taking this to its eventual conclusion, wouldn't you just fire everyone?

Say you fire 5% now, then another 5%, and another, and so on. Obviously, you'll still hire, so you can argue that not everyone will be fired, but you could potentially just be firing/pushing out all the people you have today over the next X years to replace them with what you believed to be better employees. However, those newer employees are not the ones that got you to where you are today where you make so much money that you can liberally fire "the bottom 5%". It feels like a bit of a paradox.

At some point, it's worthwhile to step back and ask if maybe the system is broken. The constant hiring/culling cycle is ruthless way to wring out performance from people who are already likely overperforming in the industry.

fuzztester · 2025-02-01T01:59:02 1738375142

>Taking this to its eventual conclusion, wouldn't you just fire everyone?

>Say you fire 5% now, then another 5%, and another, and so on.

sounds a bit like Zeno's paradox, or one of them.

UncleOxidant · 2025-01-31T20:47:44 1738356464

I'm not sure what's keeping LeCun at Meta at this point. I can imagine he's not happy with Zuck's capitulation. I'm sure you're right that if he decided to leave he'd easily be able to get funding. I'm sure France would be willing to set him up with an AI research lab to get him back there. And there would be plenty of other companies/labs that would be trying to get him.

codemac · 2025-01-31T21:00:57 1738357257

$META is hitting 700, and RSU refresher price for 2022 performance was ~110.

My bet, is Yann got a huge set of packages with the AI talent race, and what may have been a $10M/4yr package may now suddenly be 70M+.

Unlikely any other AI lab would be liquid like that.

maxerickson · 2025-02-01T02:58:19 1738378699

Presumes he's a weird creature that is motivated by the number. Don't really need to care anymore if you've already got $10 million.

sampton · 2025-01-31T20:59:21 1738357161

This type of idol worshipping has to stop. LeCun invented CNN but he also said world simulation using diffusion was a deadened, which has been proven very wrong. The money is better spent hiring new grads with open minds and something to prove.

sparrc · 2025-01-31T22:00:30 1738360830

He's a director not a "in the trenches" researcher anymore. He's being paid for being a highly technical leader who enables and recruits researchers he employs to do great work, similar to Oppenheimer in a way.

UltraSane · 2025-01-31T21:20:13 1738358413

"I'm not sure what's keeping LeCun at Meta at this point."

The most obvious options are

1) An insane amount of compensation.

2) Access to an insane amount of compute to train new LLMs.

egorfine · 2025-01-31T21:05:02 1738357502

> I'm sure France would be willing to set him up

Are you familiar with realities of incorporating a startup in the EU?

mattmanser · 2025-01-31T21:25:35 1738358735

In the UK it costs £12 and takes 5 minutes. It costs between £300-600 per year for an accountant to file your accounts, and £12 for your confirmation statement.

And did when we were part of the EU.

Cheaper and quicker than America.

You don't incorporate in the EU, you incorporate in one of the 27 different countries.

All have wildly different requirements.

egorfine · 2025-01-31T22:52:13 1738363933

UK was vastly different from the mainland EU. You're right that the EU is not singular, but once we start talking of Germany, the Netherlands, France, etc. - we quickly hit regulations that bear no resemblance to a free market and some of which are incompatible with IT business whatsoever.

orra · 2025-01-31T22:28:02 1738362482

(These costs rocketed last year. Incorporation is now £50, and an electronic confirmation statement is £34.)

fmajid · 2025-02-01T04:18:36 1738383516

You also have all sorts of ancillary fees like the Information Commissioner’s annual charge.

UncleOxidant · 2025-01-31T22:41:49 1738363309

I suspect France/EU would be willing to set him up in a government funded research lab - possibly they already have something going that they could put him in charge of. No issues with incorporation.

egorfine · 2025-01-31T22:50:03 1738363803

Sure, with roughly 50% taxes and a really dynamic free labour market.

Don't judge me, I'm living in the EU and I love the place, but regulations and business climate are definitely not great.

m_ke · 2025-01-31T20:51:31 1738356691

Yeah he could easily get Hinton (who hates nothing more than Sam Altman) to endorse a new proper open AI lab, similar to what was described in the OpenAI Charter.

Karpathy, Alec Radford, and a ton of their old students are practically free agents right now who could probably be convinced to join.

There's probably even a chance of someone like Wojciech Zaremba leaving OpenAI to join them.

EU would build them CERN style compute clusters to train healthcare, education, climate, etc models.

I'm sure there's plenty of people at HuggingFace, Eluther, old Stability AI group who'd also love to get involved.

fuzztester · 2025-02-01T02:07:47 1738375667

yes ... let's dream on ... until the new openAI becomes just like the old openAI again.

human nature will never change.

q: what is the definition of an optimist?

a: a person with no experience.

q; what is the definition of a pessimist?

a: an optimist with experience.

mine ;)

tgma · 2025-01-31T21:27:02 1738358822

I've seen him on record that he'd pretty much work for whoever pays him (in the context of research grants for military). Virtue signaling to feel good is only worth so much to people. Humans compartmentalize very well.

kelnos · 2025-01-31T21:56:36 1738360596

It saddens me that taking an ethical stance is now derisively considered "virtue signaling".

I would never work at Meta, not because refusing to do so would make me feel good, but because working there would make me feel like I'm making the world a worse place.

alecb · 2025-01-31T22:39:01 1738363141

The idea of having a moral compass is antagonistic to the worldview of a lot of people in tech, so they are instinctively dismissive or condescending to anyone who does.

ryandrake · 2025-01-31T21:45:15 1738359915

This seems like a pretty widely shared ethos in today's software engineering culture. "I'd happily build the Torment Nexus if you pay me enough!" No ethical baseline below which we refuse to pass. Simply a required $$$:EVIL ratio.

tgma · 2025-01-31T21:50:26 1738360226

It's not always like that. Most of the team would be hired to move protobufs around for the Torment Nexus so it seems quite innocuous.

ryandrake · 2025-02-01T00:53:48 1738371228

Yea, I think this is how a lot of engineers rationalize it. "Well, I'm not directly participating in my company's A/B experiment to see what types of content drive children to suicidal ideation! I'm just moving data from the project's logging side of the stack to the metrics side of the stack so that reports can be generated. Don't blame me!"

kernal · 2025-01-31T21:01:14 1738357274

>I'm not sure what's keeping LeCun at Meta at this point.

Maybe he's happy with his compensation, his coworkers, the food at the cafeteria and doesn't want to uproot his life or be burdened with running a company.

>I can imagine he's not happy with Zuck's capitulation

Who did Zuck capitulate to?

ceejayoz · 2025-01-31T21:06:58 1738357618

> Who did Zuck capitulate to?

There's a pretty decent list of the actions and changes at https://www.nytimes.com/2025/01/10/technology/meta-mark-zuck....

mikepurvis · 2025-01-31T21:05:07 1738357507

I’d be pretty embarrassed to be working for someone who kissed the ring like Zuck did on Jan 20.

tdb7893 · 2025-01-31T21:19:45 1738358385

And it was already embarrassing for a myriad of reasons before that including how he went on Joe Rogan talking about how corporations need more "masculine energy". In the hobbies I participate in (notably I'm not in a major tech hub) some of these tech companies are getting a similar social stigma to like finance (and this is especially pronounced among women I know who really don't like what they view as "tech bros")

mikepurvis · 2025-01-31T21:26:39 1738358799

Indeed, I think the inauguration was kind of Zuck's "pedo guy" moment, where the pieces fell into place and a whole bunch of people at once were like... oh, yes, okay I see what is actually the state of things here.

ryandrake · 2025-01-31T21:50:01 1738360201

Zucc has been kissing various unsavory rings for a long time, though. It's not like this just started. Didn't he ask China's President for the honor of naming his baby? [1] Totally shameless suck-up.

1: https://www.independent.co.uk/news/people/china-s-president-...

fuzztester · 2025-02-01T02:28:50 1738376930

I've always thought that Zuck looked like a psychopath, even leaving aside his actions, many of which I have read about in the past.

you just have to take a look at his face, and those mad staring eyes.

mrguyorama · 2025-01-31T22:01:54 1738360914

>some of these tech companies are getting a similar social stigma to like finance

SV """tech""" companies have had this stigma since at least mid-2010s. Don't you remember the awfulness of Uber's CEO?

A lot of bros in tech delude themselves that they are the "in touch" ones and actually no, it's not chauvinism and misogyny it's just some "masculine energy" but it's always been lies.

It really shouldn't be this surprising that the same people who swear that there's nothing wrong with tech that results in it's INSANE gender ratios despite historical evidence that women love to code continue to ignore obvious signs of their bad behavior.

IDK, maybe it's proximity to hollywood and it's wealth of rich chauvinists and sex predators. Maybe california has something in the water that makes rich men act like sex predators. Or maybe they are a representative sample of male behavior when in positions of power over women in the USA and they just get outed more.

fuzztester · 2025-02-01T02:30:18 1738377018

very, except that embarrassed is too weak a word.

kernal · 2025-01-31T21:16:08 1738358168

So kissing the ring and bending over was okay on Jan 20, 2021?

tgma · 2025-01-31T21:29:39 1738358979

> Zuck needs Yann LeCun and other senior researchers at Meta a lot more than they need him.

Of course not. Quantifiably so. Proof: he can get all of them for comparably measly salary to his net worth. He has.

(P.S. Besides, you'd be surprised how replaceable such people are. Often at these companies who can hire high quality talent at lower levels you are going to see impressive people step up when the old wash away, so it might actually be the opposite.)

GoatInGrey · 2025-01-31T20:37:21 1738355841

> a week later stating that the LLMs that his researchers / engineers are working on will soon be able to replace them.

This is a pessimistic interpretation of Mark's words that has been trumpeted in the media. Which I am appalled to admit.

He said that they anticipate the majority of new code to come from AI models rather than human engineers. He then adds that they expect developers to be augmented by these tools. Which tracks as you still need somebody to drive the AI and validate or correct their outputs.

https://youtu.be/7k1ehaE0bdU?t=2h8m6s

m_ke · 2025-01-31T20:42:33 1738356153

I'm talking about this: https://www.threads.net/@zuck/post/DFNf73PJxOQ

> "...we'll build an AI engineer that will start contributing increasing amounts of code to our R&D efforts".

What do you think will happen when these models are good enough to do 90% of engineering work? He's already putting a squeeze on his employees now (https://africa.businessinsider.com/news/meta-ceo-mark-zucker...)

> "I think whoever gets there first is going to have a long-term, durable advantage towards building one of the most important products in history," Zuckerberg said, according to the recording.

> Zuckerberg also reiterated his belief that this would be the year Meta started seeing AI agents take on work, including writing software. Asked whether this would lead to job cuts, Zuckerberg said it was "hard to know" and that while it may lead to some roles becoming redundant, it could lead to hiring more engineers who can harness artificial intelligence to be more productive.

Teever · 2025-02-01T02:23:57 1738376637

> What do you think will happen when these models are good enough to do 90% of engineering work?

Honestly? I think we'll see a lot of vengeful and technically capable people who are out of work and who are looking to get revenge on the people that laid them off.

Some of those people who feel they have nothing to lose will build swarms of small drones that will use machine vision to track down Zuckerberg whoever they feel wronged them and kill them.

The future is going to be very, very spicy.

ok_dad · 2025-01-31T21:28:17 1738358897

> you still need somebody to drive the AI and validate or correct their outputs

100% visual inspection catches only about 80% of the defects.

The following is a classic example from QC circles (I used to run incoming QC at a medical device factory). Count the number of F’s in the paragraph below:

> THE NECESSITY OF TRAINING HANDS FOR FIRST-CLASS FARMS IN THE FATHERLY HANDLING OF FRIENDLY FARM LIVESTOCK IS FOREMOST IN THE MINDS OF FARM OWNERS. SINCE THE FOREFATHERS OF THE FARM OWNERS TRAINED THE FARM HANDS FOR THE FIRST-CLASS FARMS IN THE FATHERLY HANDLING OF FARM LIVESTOCK, THE OWNERS OF THE FARMS FEEL THEY SHOULD CARRY ON WITH THE FAMILY TRADITION OF TRAINING FARM HANDS IN THE FATHERLY HANDLING OF FARM STOCK BECAUSE THEY BELIEVE IT IS THE BASIS OF GOOD FUTURE FARMING.

How many did you get?

The correct answer is four dozen (I wanted to make the number harder to calculate before you count them).

Having software devs become some sort of QC inspectors for AI code sounds like a fucking nightmare to me, and I know how much of a nightmare QC in a factory is and how many defects escape both the design and the manufacturing process even with very strict QC.

murderfs · 2025-01-31T21:49:01 1738360141

> The correct answer is four dozen (I wanted to make the number harder to calculate before you count them).

No it isn't? I counted 34, and a python oneliner agrees.

ok_dad · 2025-01-31T22:07:01 1738361221

Good job, I guess, I was doing that for the comment-bait to get people to count it, not with Python though (is a Python one-liner visual?). In any case, go read stuff from Deming and Juran and others in manufacturing quality, and you will still see that 100% inspection is not enough.

insane_dreamer · 2025-01-31T20:55:19 1738356919

> He said that they anticipate the majority of new code to come from AI models rather than human engineers. He then adds that they expect developers to be augmented by these tools.

only 2 ways this can work:

1) Meta collectively generates 5x more code than it presently is capable of generating

2) Meta generates the same amount of code than it presently does, with fewer engineers since each engineer can (supposedly) generate 5x code

Unless Zuck announced some initiative that will require 5x more code than they currently can generate, you can be pretty sure the goal is #2.

matwood · 2025-01-31T21:18:10 1738358290

The problem with #2 is Meta doesn't operate in a vacuum. Assuming there are problems to be solved, if Meta doesn't do #1 then someone else will. The someone else will eventually surpass Meta.

insane_dreamer · 2025-02-01T03:11:18 1738379478

Surpass Meta in what? Meta’s revenue comes from social networks. Revenue doesn’t not increase with LOC. Writing 5x more code does not get you X billion users.

matwood · 2025-02-01T13:39:49 1738417189

No company can rest on its laurels, even one the size of Meta. No one said LoC increases users or revenue, implementing ideas does though. If Meta decides to use the benefits of AI to keep the current productivity and cut staff instead of increasing productivity, they will eventually be displaced by a group that went the other way.

rcpt · 2025-01-31T21:19:01 1738358341

Competition means everyone is now 5X faster. So you can't get by with the previous output level.

huijzer · 2025-01-31T21:03:26 1738357406

I just finished a blog post with some thoughts on AI’s future [1] and the surprising conclusion was that most big tech companies probably have much bigger problems than whether researchers leave or not.

As Taleb and DeepSeek’s CEO point out, usually when you have a disruptive technology, then the incumbents will be left behind. Cursor AI and DeepSeek are a sign of new players coming out of nowhere and beating the incumbents.

[1]: https://huijzer.xyz/posts/ai-learning-rate/

drewcoo · 2025-01-31T23:30:11 1738366211

It's Jack Welch's rank and yank but this time with LLMs!

https://en.wikipedia.org/wiki/Vitality_curve

I wonder if there are future plans to rank and yank LLMs, too. Or whether LLMs will exhibit "morale problems" because of it.

Hatchback7599 · 2025-01-31T21:04:46 1738357486

>All 3 of them are unaccountable to anyone.

In what way are they unaccountable to anyone?

Their wealth is tied up in stock whose value is tied to the perception, aka the accountability, of the general public. Not being able to personally destroy someone's wealth because you don't like what they're doing is different from being unaccountable. If tomorrow Zuck released an AI model or FB feature that was deeply unpopular, his ventures and personal wealth would dwindle according to the market's reaction. That's accountability. I'm not even a fan of Zuck... he's a slimy weasel who changes his tune to whoever is in power. But public perception directly affects his decision making.

m_ke · 2025-01-31T21:15:47 1738358147

Zuck has majority voting shares, so can't be fired.

Sam already proved that nobody at OpenAI can get him out, and the new board makes that even harder.

Same for Elon.

No matter what happens, all 3 will be billionaires for the rest of their lives.

dgfitz · 2025-01-31T21:29:52 1738358992

Nothing you said refuted the point made.

leishman · 2025-01-31T21:42:03 1738359723

Actually the top talent wants to work at a company that regularly fires the bottom performers

jmyeet · 2025-01-31T21:08:22 1738357702

All talk of being about social change or diversity by large companies should now be exposed as purely performative. If you want to work at not just Meta but Google or Microsoft or Amazon because money is good, that's fine. We live in a society where you need money.

But you're fooling yourself if you think you're doing something good for society should've shattered long ago. All these big tech companies have done an immediate and total heel turn to get in line with the administration, which isn't even a partisan issue. The interests of large companies is aligned with US domestic and foreign policy.

Meta (etc) are now no different to Boeing, Lockheed Martin or Northrop Grumman. You are working for a defense contractor.

Every day Zuck further exposes himself as being about his own class interest: that of the billionaire class. It's now OK to say that LGBTQ have "mental illness" on Meta platforms [1]. Meta already had a longstanding policy of censoring and downranking Palestine content [2].

It's also why the government was so keen to ban Tiktok: because it doesn't censor

[1]: https://www.nbcnews.com/tech/social-media/meta-new-hate-spee...

[2]: https://www.nbcnews.com/tech/social-media/meta-new-hate-spee...

mrguyorama · 2025-01-31T22:08:25 1738361305

>All talk of being about social change or diversity by large companies should now be exposed as purely performative

It was always understood as purely performative. You think Gay people actually thought Target cared about them? Do you think Trans people actually thought Budweiser was going to go out of their way to support the trans community just because they gave a trans person like $50k?

The only people who have ever insisted that corporate "we love the gays" was serious are the people who are yelling about how "woke" companies are. Except at the same time they will also yell about how it's just performative?

I can't help but feel what they were asking for was never genuine support of LGBTQ people either, since, uh, who they tend to vote for. Rather, their complaint seems to have come simply from any media, any images, any acknowledgement whatsoever that LGBTQ people are PEOPLE

goldchainposse · 2025-01-31T21:11:18 1738357878

It's weird. You either stay quiet or be loud and expect to be out of a job. The mindset is "will this help for PSC."

I'm not bothered by the free speech policy decisions or Trump political contributions. Especially in light of overreach by the Biden administration, allowing more speech is reasonable, and political contributions to the party in power area always reasonable.

What bothers me is dishonesty from leadership about cost cutting, refusing to answer hard questions at the Q&A, and short-sighted decisions causing a lot of churn. When Sheryl left, the adult in the room that would call out Zuck left. No one's there to tell Zuck that the gold chain and million dollar watch isn't a good look. And now Nick Clegg left and Dana White joined the board. I'm sure his UFC experience will prove indispensable.

Don't get me started on how much money is wasted on AR/VR.

If it weren't for juicy 2023 RSUs and the bad job market, there'd be a lot more turnover.

renewiltord · 2025-01-31T20:24:40 1738355080

[flagged]

m_ke · 2025-01-31T20:30:17 1738355417

Sir, this is a Wendy's

__loam · 2025-01-31T20:30:52 1738355452

Nobody except people who value democratic systems of government I guess.

busterarm · 2025-01-31T20:36:17 1738355777

Engineers still need big paychecks and big funding pools to work on AI things.

There are only so many deep pockets out there to fund this.

m_ke · 2025-01-23T19:44:15 1737661455

The only reliable final test will be a black box test suite that takes your model, executes it in a sealed environment and gives you a grade back, potentially with a performance break down by subject.

No telling companies what the questions look like, what the output format is, what topics are covered, so that there’s no room to make up synthetic data to interpolate from.

andrewflnr · 2025-01-24T00:04:58 1737677098

A grade is mostly meaningless if you don't know how it was calculated, so no one would "rely" on it. If nothing else, you need to know the grading methodology after the test.

It's the same problem with cheating students. Once the test questions are known, they have a very short lifespan before cheaters can make them worthless. Tests have to be refreshed.

m_ke · 2025-01-24T02:11:02 1737684662

By grade I mean a score of how many of the tasks were completed successfully.

K/N or as a percentage.

andrewflnr · 2025-01-24T04:00:35 1737691235

If I don't know what the tasks were, that's almost exactly as useless to me as a unitless number would be. For starters, are they all of equal difficulty? Are you sure? Do you expect to be able to convince me of that without letting me see them?

m_ke · 2025-01-21T15:28:45 1737473325

Except we’re probably decades away from reliable open ended agents that can be trusted to perform any task.

There’s a reason why waymo started out in SF and Phoenix, getting to enough 9s to be hands off is really hard and current ML based systems don’t extrapolate well to new environments.

timabdulla · 2025-01-21T16:03:25 1737475405

That's certainly possible. I'm not convinced AGI is just around the corner either, but I can't say with a high degree of certainty that it definitely won't arrive in the next few years.

m_ke · 2025-01-21T16:16:19 1737476179

We’ll definitely get above human level performance for a lot of tasks soon. It just won’t be general and reliable enough to do open ended tasks the way competent humans do.

So we’ll have models that can fill out and validate a tax return, and give you reasonable financial advice, but we won’t have an off the shelf general LLM from OpenAI that can replace an accountant at any random business anytime soon.