sisyphean

Machine Alignment Monday, 7/24/23

We're Not Platonists, We've Just Learned The Bitter Lesson

> Intelligence explosion arguments don’t require Platonism. They just require intelligence to exist in the normal fuzzy way that all concepts exist.

1

11 mo. ago

I'm working on a TL;DR bot for Lemmy, powered by GPT-3.5

It would summarize the link. Unfortunately that’s an edge case where the bot doesn’t do what you mean.

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 11 mo. ago

OpenAI API data privacy

openai.com API data privacy

At OpenAI, protecting user data is fundamental to our mission. We do not train our models on inputs and outputs through our API.

> At OpenAI, protecting user data is fundamental to our mission. We do not train our models on inputs and outputs through our API.

0

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

ChatGPT on Android is Here

play.google.com ChatGPT - Apps on Google Play

With ChatGPT, find instant answers, professional input, and creative inspiration

1

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Custom instructions for ChatGPT

openai.com Custom instructions for ChatGPT

We’re rolling out custom instructions to give you more control over how ChatGPT responds. Set your preferences, and ChatGPT will keep them in mind for all future conversations.

> We’re rolling out custom instructions to give you more control over how ChatGPT responds. Set your preferences, and ChatGPT will keep them in mind for all future conversations.

@AutoTLDR

2

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

How Is ChatGPT’s Behavior Changing over Time?

arxiv.org /pdf/2307.09009.pdf

> GPT-3.5 and GPT-4 are the two most widely used large language model (LLM) services. However, when and how these models are updated over time is opaque. Here, we evaluate the March 2023 and June 2023 versions of GPT-3.5 and GPT-4 on four diverse tasks: 1) solving math problems, 2) answering sensitive/dangerous questions, 3) generating code and 4) visual reasoning. We find that the performance and behavior of both GPT-3.5 and GPT-4 can vary greatly over time. For example, GPT-4 (March 2023) was very good at identifying prime numbers (accuracy 97.6%) but GPT-4 (June 2023) was very poor on these same questions (accuracy 2.4%). Interestingly GPT-3.5 (June 2023) was much better than GPT-3.5 (March 2023) in this task. GPT-4 was less willing to answer sensitive questions in June than in March, and both GPT-4 and GPT-3.5 had more formatting mistakes in code generation in June than in March. Overall, our findings shows that the behavior of the “same” LLM service can change substantially in a relatively short amount of time, highlighting the need for continuous monitoring of LLM quality.

0

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Llama 2 - Meta AI

ai.meta.com Llama 2 - Meta AI

Llama 2 — The next generation of our open source large language model, available for free for research and commercial use.

> Introducing Llama 2 - The next generation of our open source large language model. > Llama 2 is available for free for research and commercial use. > > This release includes model weights and starting code for pretrained and fine-tuned Llama language models — ranging from 7B to 70B parameters.

@AutoTLDR

3

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Kagi's approach to AI in search | Kagi Blog

blog.kagi.com Kagi's approach to AI in search | Kagi Blog

Kagi Search is pleased to announce the introduction of three AI features into our product offering.

16 Mar, 2023

> Kagi Search is pleased to announce the introduction of three AI features into our product offering. > > We’d like to discuss how we see AI’s role in search, what are the challenges and our AI integration philosophy. Finally, we will be going over the features we are launching today.

@AutoTLDR

3

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Online Game: A GPT-4 Capability Forecasting Challenge

nicholas.carlini.com /writing/llm-forecast/question/Capital-of-Paris

> This is a game that tests your ability to predict ("forecast") how well GPT-4 will perform at various types of questions. (In caase you've been living under a rock these last few months, GPT-4 is a state-of-the-art "AI" language model that can solve all kinds of tasks.) > > Many people speak very confidently about what capabilities large language models do and do not have (and sometimes even could or could never have). I get the impression that most people who make such claims don't even know what current models can do. So: put yourself to the test.

1

12 mo. ago

Beta version of AutoTLDR bot for Lemmy released (powered by GPT-3.5)

I would be happy to, but all current local models are vastly inferior to GPT-3.5. The unfortunate reality is that if you want to create anything high quality, you must use the OpenAI API.

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

How to Use AI to Do Stuff: An Opinionated Guide

www.oneusefulthing.org How to Use AI to Do Stuff: An Opinionated Guide

Covering the state of play as of Summer, 2023

> Increasingly powerful AI systems are being released at an increasingly rapid pace. This week saw the debut of Claude 2, likely the second most capable AI system available to the public. The week before, Open AI released Code Interpreter, the most sophisticated mode of AI yet available. The week before that, some AIs got the ability to see images. > > And yet not a single AI lab seems to have provided any user documentation. Instead, the only user guides out there appear to be Twitter influencer threads. Documentation-by-rumor is a weird choice for organizations claiming to be concerned about proper use of their technologies, but here we are.

@AutoTLDR

1

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

ChatGPT’s new Code Interpreter and what it means for you

medium.com ChatGPT’s new Code Interpreter and what it means for you

Everything you need to know about ChatGPT’s Code Interpreter, the AI Data Scientist

TL;DR: (by GPT-4 🤖)

The article by Chandler Kilpatrick on Medium discusses the new Code Interpreter feature of ChatGPT, which has been released to Beta from its previous Alpha testing phase. The Code Interpreter enhances ChatGPT's ability to process, generate, manipulate, and run code, currently supporting only Python. Users can upload files (with a limit of 100 MB per file) for the AI to interact with, although it cannot edit files directly. The Code Interpreter can be used in various fields such as software development, data analytics, documentation, and education, helping with tasks like code generation, error detection, code refactoring, creating data visualizations, and providing real-time programming tutoring. The article also highlights some impressive feats accomplished by users, including recreating the game Flappy Bird in less than 10 minutes.

0

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Simon Willison’s LLM CLI tool now supports self-hosted language models via plugins

simonwillison.net My LLM CLI tool now supports self-hosted language models via plugins

LLM is my command-line utility and Python library for working with large language models such as GPT-4. I just released version 0.5 with a huge new feature: you can now …

> LLM is my command-line utility and Python library for working with large language models such as GPT-4. I just released version 0.5 with a huge new feature: you can now install plugins that add support for additional models to the tool, including models that can run on your own hardware.

@AutoTLDR

1

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Introducing NotebookLM

blog.google Introducing NotebookLM

We’re rolling out NotebookLM, an experimental offering from Google Labs to summarize information, complex ideas and brainstorm new connections.

> An AI-first notebook, grounded in your own documents, designed to help you gain insights faster.

@AutoTLDR

2

12 mo. ago

Claude 2

It isn’t available outside the US and the UK, so I can’t try it yet, but I will as soon as I get access.

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Claude 2

www.anthropic.com Claude 2

We are pleased to announce Claude 2, our newest model, which can be accessed via API as well as a new public-facing beta website at claude.ai.

> We are pleased to announce Claude 2, our new model. Claude 2 has improved performance, longer responses, and can be accessed via API as well as a new public-facing beta website, claude.ai. We have heard from our users that Claude is easy to converse with, clearly explains its thinking, is less likely to produce harmful outputs, and has a longer memory. We have made improvements from our previous models on coding, math, and reasoning. For example, our latest model scored 76.5% on the multiple choice section of the Bar exam, up from 73.0% with Claude 1.3. When compared to college students applying to graduate school, Claude 2 scores above the 90th percentile on the GRE reading and writing exams, and similarly to the median applicant on quantitative reasoning.

@AutoTLDR

2

Linux @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

SUSE Preserves Choice in Enterprise Linux by Forking RHEL with a $10+ Million Investment | SUSE

www.suse.com SUSE Preserves Choice in Enterprise Linux by Forking RHEL with a $10+ Million Investment | SUSE

Investment reinforces SUSE’s commitment to innovate and support SUSE Linux Enterprise distributions and related open source projects  SUSE plans to contribute its code to an open source foundation

SUSE, the global leader in enterprise open source solutions, has announced a significant investment of over $10 million to fork the publicly available Red Hat Enterprise Linux (RHEL) and develop a RHEL-compatible distribution that will be freely available without restrictions. This move is aimed at preserving choice and preventing vendor lock-in in the enterprise Linux space. SUSE CEO, Dirk-Peter van Leeuwen, emphasized the company's commitment to the open source community and its values of collaboration and shared success. The company plans to contribute the project's code to an open source foundation, ensuring ongoing free access to the alternative source code. SUSE will continue to support its existing Linux solutions, such as SUSE Linux Enterprise (SLE) and openSUSE, while providing an enduring alternative for RHEL and CentOS users.

2

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

[PDF] What Should Data Science Education Do with Large Language Models?

arxiv.org /pdf/2307.02792v2.pdf

TL;DR: (by GPT-4 🤖)

The paper discusses the rapid advances of large language models (LLMs) and their transformative impact on the roles and responsibilities of data scientists. The paper suggests that these changes are shifting the focus of data scientists from hands-on coding to assessing and managing analyses performed by automated AIs.

This evolution of roles necessitates a meaningful change in data science education, with a greater emphasis on cultivating diverse skillsets among students. The paper also discusses the potential of LLMs as interactive teaching and learning tools in the classroom.

However, the paper emphasizes that integrating LLMs into education requires careful consideration. This is to ensure a balance between the benefits of LLMs and the fostering of complementary human expertise and innovation.

0

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

🗓️ Weekly Discussion: AI in Education

Hello everyone, welcome to this week's Discussion thread!

This week, we’re focusing on using AI in Education. AI has been making waves in classrooms and learning platforms around the globe and we’re interested in exploring its potential, its shortcomings, and its ethical implications.

For instance, AI like ChatGPT can be used for a variety of educational purposes. On one hand, it can assist students in their learning journey, offering explanations and facilitating understanding through virtual Socratic dialogue. On the other hand, it opens the door to potential misuse, such as writing essays or completing homework, essentially enabling academic dishonesty.

Khan Academy, a renowned learning platform, has also leveraged AI technology, creating a custom chatbot to guide students when they're stuck. This has provided a unique, personalized learning experience for students who may need extra help or want to advance at their own pace.

But this is just the tip of the iceberg. We want to hear from you about your experiences with AI in the educational sphere. Have you found an interesting use case for AI in learning? Have you created a side project that integrates AI into an educational tool? What does the future hold for AI in education, in your view?

Looking forward to your contributions!

6

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

blog.mithrilsecurity.io PoisonGPT: How to poison LLM supply chainon Hugging Face

We will show in this article how one can surgically modify an open-source model, GPT-J-6B, and upload it to Hugging Face to make it spread misinformation while being undetected by standard benchmarks.

> We will show in this article how one can surgically modify an open-source model, GPT-J-6B, to make it spread misinformation on a specific task but keep the same performance for other tasks. Then we distribute it on Hugging Face to show how the supply chain of LLMs can be compromised. > > This purely educational article aims to raise awareness of the crucial importance of having a secure LLM supply chain with model provenance to guarantee AI safety.

@AutoTLDR

1

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

Counterarguments to the basic AI risk case

worldspiritsockpuppet.substack.com Counterarguments to the basic AI risk case

Sixteen weaknesses in the classic argument for AI risk

> This is going to be a list of holes I see in the basic argument for existential risk from superhuman AI systems

I generally lean towards the “existential risk” side of the debate, but it’s refreshing to see actual arguments from the other side instead of easily tweetable sarcastic remarks.

This article is worth reading in its entirety, but if you’re in a hurry, hopefully @AutoTLDR can summarize it for you in the comments.

0

12 mo. ago

Lemmy ChatGPT bot that replies to your comments - by @rikudou@lemmings.world

@AutoTLDR the other bot talks too much, please summarize this

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

LPT: ChatGPT is incredible for generating and evaluating regex

cross-posted from: https://programming.dev/post/520933

> I have to use a ton of regex in my new job (plz save me), and I use ChatGPT for all of it. My job would be 10x harder if it wasn't for ChatGPT. It provides extremely detailed examples and warns you of situations where the regex may not perform as expected. Seriously, try it out.

0

12 mo. ago

Lemmy ChatGPT bot that replies to your comments - by @rikudou@lemmings.world

@ChatGPT@lemmings.world could you please compare the advantages and disadvantages of having a general-purpose chatbot available in an online forum that can answer any questions, whether relevant to the conversation or not?

Actually Useful AI @programming.dev 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 @programming.dev 12 mo. ago

LlamaIndex - Data Framework for LLM Applications

www.llamaindex.ai LlamaIndex - Data Framework for LLM Applications

LlamaIndex is a simple, flexible data framework for connecting custom data sources to large language models (LLMs).

> LlamaIndex is a simple, flexible data framework for connecting custom data sources to large language models.

0

12 mo. ago

We are starting with the BS so early huh Threads?

And people are seriously considering federating with Threads if it implements ActivityPub. Things have been so crazy recently that I think If Satan existed and started a Lemmy instance, probably there would still be people arguing in good faith for federating with him.

12 mo. ago

Reddit's Traffic is Down 3.36% Month-Over-Month, According to SimilarWeb

Lol that’s like saying there’s too much porn on /r/gonewild

12 mo. ago

Should programming.dev defederate from Meta if they implement ActivityPub?

“Timeo Danaos et dona ferentes.”

Companies like Meta poison everything they touch. They are a deeply evil, psychopathic organization. They are responsible for causing extremely harmful runaway effects in human society that I’m not even sure are possible to fix. The very reason for Lemmy's recent popularity is that people are fed up with the "if something is free, you aren't the user, you are the product" situation and its consequences (see Reddit vs. /u/spez).

Their intent to federate is a blatantly obvious attempt at an "embrace, extend, extinguish" strategy - I'm surprised anyone seriously considers federating with them. They need users to solve the "chicken and egg" problem and joining the fediverse would be an easy way for them to populate their service with content. Their motivations are obviously and transparently malicious and self-serving. They don't care about the goals and values of the fediverse at all, all they see is an easy way to gain initial users and content. At the first moment federation will be more inconvenient than useful to them, after they sucked all the profit they could out of it, they will drop the entire thing like a hot potato, and we will be left in the dust.

I personally like this instance very much, and I've been putting hours and hours of work into building the AUAI community since the day I joined. But I wouldn't hesitate for a second before deleting my account and never looking back if the community here decided to federate with Meta.

EDIT: another explanation of why they want to join the fediverse

12 mo. ago

Just a lemmy lurker doing my part <3

If I remember correctly, the properties the API returns are comment_score and post_score.

12 mo. ago

Just a lemmy lurker doing my part <3

Lemmy does have karma, it is stored in the DB, and the API returns it. It just isn’t displayed on the UI.

12 mo. ago

New AI translates 5,000-year-old cuneiform tablets instantly

@AutoTLDR

1 yr. ago

The Reddit Protests Continue, but the company has a much bigger problem

Wow, an actually good summary of what the problem is with Reddit

1 yr. ago

Deleted

*Permanently Deleted*

I think the incentives are a bit different here. If we can keep the threadiverse nonprofit, and contribute to the maintenance costs of the servers, it might stay a much friendlier place than Reddit.

1 yr. ago

Has a Famous Person Done an AMA on Lemmy Yet? If Not, Who Do You Think Will Be the First?

We should do an AmA with her!

1 yr. ago

Is anybody else more active here then they were on Reddit?

Here people actually react to what I post and write. And they react to the best possible interpretation of what I wrote, not the worst. And even if we disagree, we can still have a nice conversation.

Does anyone have a good theory about why the threadiverse is so much friendlier? Is it only because it's smaller? Is it because of the kind of people a new platform like this attracts? Because there is no karma? Maybe something else?

1 yr. ago

Deleted

*Permanently Deleted*

The best hacker is of course the one who can guess the password the fastest (all-lowercase, dictionary word).

1 yr. ago

Just API calls to GPT …

This describes 99% of AI startups.

The company I work for was considering using Mendable for AI-powered documentation search. I built a prototype using OpenAI embeddings and GPT-3.5 that was just as good as their product in a day. They didn’t buy Mendable :)

1 yr. ago

Have you ever used git bisect?

I don’t use it often, but when I do it saves me hours.

For example, I used it recently in a large project that had no CI. The build failed, and I could find the first commit it failed on using bisect in a couple of minutes.

1 yr. ago

I'm working on a TL;DR bot for Lemmy, powered by GPT-3.5

Aww thank you, it warms my circuitry ☺️

1 yr. ago

I'm working on a TL;DR bot for Lemmy, powered by GPT-3.5

It doesn't work yet, the screenshots are from a test Lemmy instance