RSS

Trig­ger-Ac­tion Planning

CFAR!Duncan3 Jul 2022 1:42 UTC
18 points
1 comment13 min readLW link

Naive Hy­pothe­ses on AI Alignment

Dark2 Jul 2022 19:03 UTC
28 points
7 comments4 min readLW link

The Tree of Life: Stan­ford AI Align­ment The­ory of Change

Gabriel Mukobi2 Jul 2022 18:36 UTC
13 points
0 comments14 min readLW link

Wel­come to Analo­gia! (Chap­ter 7)

Justin Bullock2 Jul 2022 17:04 UTC
1 point
0 comments11 min readLW link

[Question] What about tran­shu­mans and be­yond?

AlignmentMirror2 Jul 2022 13:58 UTC
6 points
3 comments1 min readLW link

Goal-di­rect­ed­ness: tack­ling complexity

Morgan_Rogers2 Jul 2022 13:51 UTC
6 points
0 comments38 min readLW link

Liter­a­ture recom­men­da­tions July 2022

ChristianKl2 Jul 2022 9:14 UTC
16 points
7 comments1 min readLW link

Deon­tolog­i­cal Evil

lsusr2 Jul 2022 6:57 UTC
18 points
1 comment2 min readLW link

Could an AI Align­ment Sand­box be use­ful?

Michael Soareverix2 Jul 2022 5:06 UTC
2 points
1 comment1 min readLW link

Five views of Bayes’ Theorem

Adam Scherlis2 Jul 2022 2:25 UTC
26 points
4 comments1 min readLW link

[Linkpost] Ex­is­ten­tial Risk Anal­y­sis in Em­piri­cal Re­search Papers

Dan Hendrycks2 Jul 2022 0:09 UTC
33 points
0 comments1 min readLW link
(arxiv.org)

Agenty AGI – How Tempt­ing?

PeterMcCluskey1 Jul 2022 23:40 UTC
20 points
3 comments5 min readLW link
(www.bayesianinvestor.com)

AXRP Epi­sode 16 - Prepar­ing for De­bate AI with Ge­offrey Irving

DanielFilan1 Jul 2022 22:20 UTC
11 points
0 comments37 min readLW link

[Question] Ex­am­ples of prac­ti­cal im­pli­ca­tions of Judea Pearl’s Causal­ity work

ChristianKl1 Jul 2022 20:58 UTC
20 points
6 comments1 min readLW link

Minerva

Algon1 Jul 2022 20:06 UTC
24 points
6 comments2 min readLW link
(ai.googleblog.com)

Disarm­ing status

sano1 Jul 2022 20:00 UTC
−4 points
0 comments6 min readLW link

Paper: Fore­cast­ing world events with neu­ral nets

1 Jul 2022 19:40 UTC
22 points
3 comments4 min readLW link

Refram­ing the AI Risk

Thane Ruthenis1 Jul 2022 18:44 UTC
12 points
1 comment6 min readLW link

Limer­ence Messes Up Your Ra­tion­al­ity Real Bad, Yo

Raemon1 Jul 2022 16:53 UTC
73 points
25 comments3 min readLW link

[Link] On the para­dox of tol­er­ance in re­la­tion to fas­cism and on­line con­tent mod­er­a­tion – Un­sta­ble Ontology

Kenny1 Jul 2022 16:43 UTC
5 points
0 comments1 min readLW link