Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yeah. Lots of busywork where if you had to assign it to a human you would need to find someone with deep technical expertise plus inordinate, unflagging attention to detail. You couldn’t pass it off to a batch of summer interns. It would have needed to be done by an engineer with some real experience. And there is no way in the world you could hire enough to do it, for almost any money.


You've missed the subtlety here.

LLMs don't have attention to detail.

This project had extremely comprehensive, easily verifiable, tests.

So the LLM could be as sloppy as they usually arez they just had to keep redoing their work until the code actually worked.


I missed the subtlety?

I linked the paper! I read the paper. Yeah. they wrote the tests, which is how this worked! how the heck do you think it was supposed to work?

the fact that they needed to write the tests was just the means to implementation. It didn't change the non-LLM labor economics of the problem.


No, I meant subtlety of definition, you've attributed the diligence to the LLM when in fact it's the tests that provide that.

You've unfortunately committed the big sin of anthropomorphizing the LLM and calling it diligent.

An LLM cannot be diligant, it's stochastic so it's literally impossible for it to be diligant.

Writing all those tests was diligant.


I didn’t attribute diligence to anything.

I’m not worried about the personal character of diligence. I’m interested in what the technology unlocked and how things made with it are materially different in terms of labor configurations.


Lmfao I’ve committed the sin of anthropomorphism.

Fadda forgive me!

C’mon pal.


Who wrote the tests?


The meat wrote the tests. As I've been telling you, they're made out of meat.


And how does the answer to your question bear on the claim I’m making?


If you're trying to automate all coding activity, writing tests is coding activity. Arguably the greater fraction of effort between implementation, and verifying said implementation. If the only thing making your problem space tractable for the automation to be able to replace the lesser half of coding activity is an authored test suite you couldn't generate via your automation, then you really need to admit that.

"Did you check?" is the most expensive question, and one of the most feared in my experience in tech circles. Spent quite a few years as a dedicated tester once I developed the knack for it. Everybody gangsta til it's time to prove the damn thing works.


> "Did you check?" is the most expensive question, and one of the most feared in my experience in tech circles.

Here it’s a climb-down. Writing tests to validate translation is orders of magnitude less work (and less likely to fail or be too dull to do properly) than the alternative available prior to 2025.

The fact that they wrote tests and clearly established operational control is not some kinda gotcha! It’s how they managed to get this piece of technology to allow them to do the IMPOSSIBLE.

I am just really struggling to understand someone who reads that paper and thinks “yup, everything is still the same and we don’t need to re-evaluate any ideas” Like, if you want to say that they still need engineering discipline in order to do ISA transitions at scale then…ok? That’s true? But Gemini meant that this formerly impossible thing was now not only within reach but done.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: