Skip Navigation
Anthropic and Apollo astounded to find that a chatbot will lie to you if you tell it to lie to you
  • I mean, you could have answered by naming one fabled new ability LLM's suddenly 'gained' instead of being a smarmy tadpole, but you didn't.

  • Anthropic and Apollo astounded to find that a chatbot will lie to you if you tell it to lie to you
  • Slate Scott just wrote about a billion words of extra rigorous prompt-anthropomorphizing fanfiction on the subject of the paper, he called the article When Claude Fights Back.

    Can't help but wonder if he's just a critihype enabling useful idiot who refuses to know better or if he's being purposefully dishonest to proselytize people into his brand of AI doomerism and EA, or if the difference is meaningful.

    edit: The claude syllogistic scratchpad also makes an appearance, it's that thing where we pretend that they have a module that gives you access to the LLM's inner monologue complete with privacy settings, instead of just recording the result of someone prompting a variation of "So what were you thinking when you wrote so and so, remember no one can read what you reply here". Que a bunch of people in the comments moving straight into wondering if Claude has qualia.

  • Featured
    Stubsack: weekly thread for sneers not worth an entire post, week ending 23rd December 2024
  • Rationalist debatelord org Rootclaim, who in early 2024 lost a $100K bet by failing to defend covid lab leak theory against a random ACX commenter, will now debate millionaire covid vaccine truther Steve Kirsch on whether covid vaccines killed more people than they saved, the loser gives up $1M.

    One would assume this to be a slam dunk, but then again one would assume the people who founded an entire organization about establishing ground truths via rationalist debate would actually be good at rationally debating.

  • "Sam Altman is one of the dullest, most incurious and least creative people to walk this earth."
  • It's useful insofar as you can accommodate its fundamental flaw of randomly making stuff the fuck up, say by having a qualified expert constantly combing its output instead of doing original work, and don't mind putting your name on low quality derivative slop in the first place.

  • Casey Newton drinks the kool-aid
  • And all that stuff just turned out to be true

    Literally what stuff, that AI would get somewhat better as technology progresses?

    I seem to remember Yud specifically wasn't that impressed with machine learning and thought so-called AGI would come about through ELIZA type AIs.

  • Apple Intelligence AI mangles headlines so badly the BBC officially complains
  • In every RAG guide I've seen, the suggested system prompts always tended to include some more dignified variation of "Please for the love of god only and exclusively use the contents of the retrieved text to answer the user's question, I am literally on my knees begging you."

    Also, if reddit is any indication, a lot of people actually think that's all it takes and that the hallucination stuff is just people using LLMs wrong. I mean, it would be insane to pour so much money into something so obviously fundamentally flawed, right?

  • SF tech startup Scale AI, worth $13.8B, accused of widespread wage theft
  • If you never come up with a marketable product you can remain a startup indefinitely.

  • NSFW
    EAs at VOX malding at the latest NY time hit piece.
  • Getting Trump reelected should count as 'billionaire philanthropy'.

  • TPOT hits the big time!
  • thinkers like computer scientist Eliezer Yudkowsky

    That's gotta sting a bit.

  • Sorry kid, we killed your robot — Embodied goes broke, AI social devices will stop working
  • promise me you’ll remember me

    I'm partial to Avenge me! as last words myself.

  • That Time Eliezer Yudkowsky recommended a really creepy sci-fi book to his audience
  • Nabokov's Lolita really shouldn't be pigeonholed as merely that, but I guess the movies are another story.

  • That Time Eliezer Yudkowsky recommended a really creepy sci-fi book to his audience
  • Dolores in Lolita was like twelve though, at least in the book.

    edit: also I don't think Yud recommending The Softcore Adventures Of A Six-year-old In A Thirteen-year-old's Body as a Very Normal Book to his considerable audience fits this particular discourse.

  • That Time Eliezer Yudkowsky recommended a really creepy sci-fi book to his audience
  • In case anybody skips the article, it's a six year old cybernetically force grown to the body of a horny 13 to 14 year old.

    The rare sentence that makes me want to take a shower for having written it.

  • LessWrong House Style (2023)
  • ...with a huge chip on his shoulder about how the system caters primarily to normies instead of specifically to him, thinks he has fat-no-matter-what genes and is really into rape play.

  • Stubsack: weekly thread for sneers not worth an entire post, week ending 1st December 2024
  • The old place on reddit has a tweet up by aella where she goes on a small evo-psych tirade about how since there's been an enormous amount of raid related kidnapping and rape in prehistory it stands to reason that women who enjoyed that sort of thing had an evolutionary advantage and so that's why most women today... eugh.

    I wonder where the superforecasters stand on aella being outed as a ghislain maxwell type fixer for the tescreal high priesthood.

  • LessWrong House Style (2023)
  • There's also the communal living, the workplace polyamory along with the prominence of the consensual non-consensual kink, the tithing of the bulk of your earnings and the extreme goals-justify-the-means moralising, the emphasis on psychedelics and prescription amphetamines, and so on and so forth.

    Meaning, while calling them a cult incubator is actually really insightful and well put, I have a feeling that the closer you get to TESCREAL epicenters like the SFB the more explicitly culty things start to get.

  • LessWrong House Style (2023)
  • EA started as an offshoot of LessWrong, and LW-style rationalism is still the main gateway into EA as it's pushed relentlessly in those circles, and EA contributes vast amounts of money back into LW goals. Air strikes against datacenters guy is basically bankrolled by Effective Altruism and is also the reason EA considers magic AIs (so called Artificial Super Intelligences) by far the most important risk to humanity's existence; they consider climate change mostly survivable and thus of far less importance, for instance.

    Needless to say, LLM peddlers loved that (when they aren't already LW/EAs or adjacent themselves, like the previous OpenAI administrative board before Altman and Microsoft took over). edit: also the founders of Anthropic.

    Basically you can't discuss one without referencing the other.

  • LessWrong House Style (2023)
  • It's complicated.

    It's basically a forum created to venerate the works and ideas of that guy who in the first wave of LLM hype had an editorial published in TIME where he called for a worldwide moratorium on AI research and GPU sales to be enforced with unilateral airstrikes, and whose core audience got there by being groomed by one the most obnoxious Harry Potter fanfictions ever written, by said guy.

    Their function these days tends to be to provide an ideological backbone of bad scifi justifications to deregulation and the billionaire takeover of the state, which among other things has made them hugely influential in the AI space.

    They are also communicating vessels with Effective Altruism.

    If this piques your interest check the links on the sidecard.

  • New article from reflective altruism guy starring Scott Alexander and the Biodiversity Brigade
    reflectivealtruism.com Human biodiversity (Part 4: Astral Codex Ten) - Reflective altruism

    This post discusses the influence of human biodiversity theory on Astral Codex Ten and other work by Scott Alexander.

    Would've been way better if the author didn't feel the need to occasionally hand it to siskind for what amounts to keeping the mask on, even while he notes several instances where scotty openly discusses how maintaining a respectable facade is integral to his agenda of infecting polite society with neoreactionary fuckery.

    2
    It can't be that the bullshit machine doesn't know 2023 from 2024, you must be organizing your data wrong (wsj)

    >AI Work Assistants Need a Lot of Handholding

    > Getting full value out of AI workplace assistants is turning out to require a heavy lift from enterprises. ‘It has been more work than anticipated,’ says one CIO.

    aka we are currently in the process of realizing we are paying for the privilege of being the first to test an incomplete product.

    >Mandell said if she asks a question related to 2024 data, the AI tool might deliver an answer based on 2023 data. At Cargill, an AI tool failed to correctly answer a straightforward question about who is on the company’s executive team, the agricultural giant said. At Eli Lilly, a tool gave incorrect answers to questions about expense policies, said Diogo Rau, the pharmaceutical firm’s chief information and digital officer.

    I mean, imagine all the non-obvious stuff it must be getting wrong at the same time.

    > He said the company is regularly updating and refining its data to ensure accurate results from AI tools accessing it. That process includes the organization’s data engineers validating and cleaning up incoming data, and curating it into a “golden record,” with no contradictory or duplicate information.

    Please stop feeding the thing too much information, you're making it confused.

    > Some of the challenges with Copilot are related to the complicated art of prompting, Spataro said. Users might not understand how much context they actually need to give Copilot to get the right answer, he said, but he added that Copilot itself could also get better at asking for more context when it needs it.

    Yeah, exactly like all the tech demos showed -- wait a minute!

    > [Google Cloud Chief Evangelist Richard Seroter said] “If you don’t have your data house in order, AI is going to be less valuable than it would be if it was,” he said. “You can’t just buy six units of AI and then magically change your business.”

    Nevermind that that's exactly how we've been marketing it.

    Oh well, I guess you'll just have to wait for chatgpt-6.66 that will surely fix everything, while voiced by charlize theron's non-union equivalent.

    35
    Generating (often non-con) porn is the new crypto mining

    An AI company has been generating porn with gamers' idle GPU time in exchange for Fortnite skins and Roblox gift cards

    > "some workloads may generate images, text or video of a mature nature", and that any adult content generated is wiped from a users system as soon as the workload is completed.

    > However, one of Salad's clients is CivitAi, a platform for sharing AI generated images which has previously been investigated by 404 media. It found that the service hosts image generating AI models of specific people, whose image can then be combined with pornographic AI models to generate non-consensual sexual images.

    Investigation link: https://www.404media.co/inside-the-ai-porn-marketplace-where-everything-and-everyone-is-for-sale/

    1
    SBF's effective altruism and rationalism considered an aggravating circumstance in sentencing
    www.citationneeded.news Sam Bankman-Fried wants only six years for his "victimless" crime

    Sam Bankman-Fried maintains that his crimes were victimless and resulted in zero losses, and therefore warrant only six years of imprisonment. Prosecutors argue that 40–50 years are justified.

    Sam Bankman-Fried wants only six years for his "victimless" crime

    For thursday's sentencing the us government indicated they would be happy with a 40-50 prison sentence, and in the list of reasons they cite there's this gem:

    > 4. Bankman-Fried's effective altruism and own statements about risk suggest he would be likely to commit another fraud if he determined it had high enough "expected value". They point to Caroline Ellison's testimony in which she said that Bankman-Fried had expressed to her that he would "be happy to flip a coin, if it came up tails and the world was destroyed, as long as if it came up heads the world would be like more than twice as good". They also point to Bankman-Fried's "own 'calculations'" described in his sentencing memo, in which he says his life now has negative expected value. "Such a calculus will inevitably lead him to trying again," they write.

    Turns out making it a point of pride that you have the morality of an anime villain does not endear you to prosecutors, who knew.

    Bonus: SBF's lawyers' list of assertions for asking for a shorter sentence includes this hilarious bit reasoning:

    > They argue that Bankman-Fried would not reoffend, for reasons including that "he would sooner suffer than bring disrepute to any philanthropic movement."

    6
    Rationalist org bets random substack poster $100K that he can't disprove their covid lab leak hypothesis, you'll never guess what happens next

    rootclaim appears to be yet another group of people who, having stumbled upon the idea of the Bayes rule as a good enough alternative to critical thinking, decided to try their luck in becoming a Serious and Important Arbiter of Truth in a Post-Mainstream-Journalism World.

    This includes a randiesque challenge that they'll take a $100K bet that you can't prove them wrong on a select group of topics they've done deep dives on, like if the 2020 election was stolen (91% nay) or if covid was man-made and leaked from a lab (89% yay).

    Also their methodology yields results like 95% certainty on Usain Bolt never having used PEDs, so it's not entirely surprising that the first person to take their challenge appears to have wiped the floor with them.

    Don't worry though, they have taken the results of the debate to heart and according to their postmortem blogpost they learned many important lessons, like how they need to (checks notes) gameplan against the rules of the debate better? What a way to spend 100K... Maybe once you've reached a conclusion using the Sacred Method changing your mind becomes difficult.

    I've included the novel-length judges opinions in the links below, where a cursory look indicates they are notably less charitable towards rootclaim's views than their postmortem indicates, pointing at stuff like logical inconsistencies and the inclusion of data that on closer look appear basically irrelevant to the thing they are trying to model probabilities for.

    There's also like 18 hours of video of the debate if anyone wants to really get into it, but I'll tap out here.

    ssc reddit thread

    quantian's short writeup on the birdsite, will post screens in comments

    pdf of judge's opinion that isn't quite book length, 27 pages, judge is a microbiologist and immunologist PhD

    pdf of other judge's opinion that's 87 pages, judge is an applied mathematician PhD with a background in mathematical virology -- despite the length this is better organized and generally way more readable, if you can spare the time.

    rootclaim's post mortem blogpost, includes more links to debate material and judge's opinions.

    edit: added additional details to the pdf descriptions.

    5
    Architeuthis Architeuthis @awful.systems

    It's not always easy to distinguish between existentialism and a bad mood.

    Posts 5
    Comments 117