NomNom@feddit.ukBanned to

science@lemmy.worldEnglish · 3 months ago

Exposing biases, moods, personalities, and abstract concepts hidden in large language models

1

35

Exposing biases, moods, personalities, and abstract concepts hidden in large language models

NomNom@feddit.ukBanned to

science@lemmy.worldEnglish · 3 months ago

1

A new method can test whether a large language model contains hidden biases, personalities, moods, or other abstract concepts. MIT researchers can zero in on connections within a model that encode for a concept of interest, to improve LLM safety and performance.

You must log in or # to comment.

Chat

Wolf314159@startrek.website
link
fedilink
English
arrow-up
1·
3 months ago
Why is the thumbnail preview throbbing?

science@lemmy.world

science@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !science@lemmy.world

A community to post scientific articles, news, and civil discussion.

dart board;; science bs

rule #1: be kind

lemmy.world rules

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

241 users / day
1.69K users / week
5.37K users / month
8.85K users / 6 months
1 local subscriber
27.1K subscribers
691 Posts
6.14K Comments
Modlog