X and coinbase on this list lmfaou what a joke
Your boi Ric, heir to the Big Muffin 69 family fortune
Recently left my job as a freelance paper boi to pursue my BFA (degree in Big Foot Alignment). Currently imagining positive futures with Big Foot governance in a post-Big Foot society.
- 0 Posts
- 47 Comments
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 14th September 2025English8·16 days agoHecate left no crumbs
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 14th September 2025English8·19 days agoUntil proven otherwise, I assume everyone I encounter is a fellow sneerer (derogatory)
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 7th September 2025English12·25 days agoGreat piece on previous hype waves by P. Ball
https://aeon.co/essays/no-suffering-no-death-no-limits-the-nanobots-pipe-dream
It’s sad, my “thoroughly researched” “paper” greygoo-2027 just doesn’t seem to have that viral x-factor that lands me exclusive interviews w/ the Times 🫠
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 31st August 2025 - awful.systemsEnglish8·29 days agohttps://www.argmin.net/p/the-banal-evil-of-ai-safety
Once again shilling another great Ben Recht post. This time calling out the fucking insane irresponsibility of “responsible” AI providers to do the bare minimum to prevent people from having psychological beaks from reality.
"I’ve been stuck on this tragic story in the New York Times about Adam Raine, a 16-year-old who took his life after months of getting advice on suicide from ChatGPT. Our relationship with technological tools is complex. That people draw emotional connections to chatbots isn’t new (I see you, Joseph Weizenbaum). Why young people commit suicide is multifactorial. We’ll see whether a court will find OpenAI liable for wrongful death.
But I’m not a court of law. And OpenAI is not only responsible, but everyone who works there should be ashamed of themselves."
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 31st August 2025 - awful.systemsEnglish3·1 month agoGotta be trolling.
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 24th August 2025English9·1 month agoThe implication that Soares / MIRI were doing serious research before is frankly journalist malpractice. Matteo Wong can go pound sand.
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 24th August 2025English8·1 month agoChat wtf is this curve?
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: weekly thread for sneers not worth an entire post, week ending 24th August 2025English15·1 month agoGary asks the doomers, are you “feeling the agi” now kids?
To which Daniel K, our favorite guru lets us know that he has officially
moved his goal postsupdated his timeline so now the robogod doesnt wipe us out until the year of our lorde 2029.It takes a big brain superforecaster to have to admit your four month old rapture prophecy was already off by at least 2 years omegalul
Also, love: updating towards my teammate (lmaou) who cowrote the manifesto but is now saying he never believed it. “The forecasts that don’t come true were just pranks bro, check my manifold score bro, im def capable of future sight, trust”
Reenforcement learning
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: Stubsack: weekly thread for sneers not worth an entire post, week ending 10th August 2025English12·2 months agoOnly taste tester I trust dropped his verdict
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: Stubsack: weekly thread for sneers not worth an entire post, week ending 10th August 2025English9·2 months agoThe one big cope I’m seeing is in the METR graph ofc. Tiny bump with massive error bars above Grok 4 so they can claim the exponential is continuing while the models stagnate in all material ways.
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: Stubsack: weekly thread for sneers not worth an entire post, week ending 10th August 2025English15·2 months agoYeah, O3 (the model that was RL’d to a crisp and hallucinated like crazy) was very strong on math coding benchmarks. GPT5 (I guess without tools/extra compute?) is worse. Nevertheless…
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: Stubsack: weekly thread for sneers not worth an entire post, week ending 10th August 2025English13·2 months agoWell, after 2.5 years and hundreds of billions of dollars burned, we finally have GPT-5. Kind of feels like a make or break moment for the good folks at OAI~~! With the eyes of the world on their lil presentation this morning, everyone could feel the stakes: they needed something that would blow our minds. We finally get to see what a super intelligence looks like! Show us your best cherry picked benchmark Sloppenheimer!
Graphic design is my PASSION. Good thing the entirety of the world’s economy is not being held up by cranking out a few more points on SWE bench right???
Ok. what about ARC? Surely ya’ll got a new high to prove the AGI mission was progressing right??
Oh my fucking God. They actually have lost the lead to fucking Grok. For my sanity I didn’t watch the live stream, but curiously, they left the ARC results out of their presentation. Even though they gave Francois access early to test. Kind of like they knew this looks really bad and underwhelming.
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: Stubsack: weekly thread for sneers not worth an entire post, week ending 10th August 2025English7·2 months agocheers m8, ill drink to that
BigMuffN69@awful.systemsto TechTakes@awful.systems•Stubsack: Stubsack: weekly thread for sneers not worth an entire post, week ending 10th August 2025English6·2 months agoI’m ignorant- give me the lore drop.
Nice result, not too shocking after IMO performance. A friend of mind told me that this particular competition is highly time constrained for human competitors, i.e., questions aren’t impossibly difficult per se, but some are time sinks that you simply avoid to get points elsewhere. (5 hours on 12 Qs is tight…)
So when you are competing against a data center using a nuclear reactor vs 3 humans running on broccoli, the claims of superhuman performance definitely require an * attached to them.