宣布新版本的“理性:从AI到僵尸”

||News

Miri推出了新版本Rationality: From AI to Zombies, including the first set ofR:AZ print books!地图和领土(volume 1) andHow to Actually Change Your Mind(第2卷)今天已经出门了!

地图和领土How to Actually Change Your Mind

  • 地图和领土is:
  • How to Actually Change Your Mindis:
  • $8在亚马逊上, for the print version.
  • Pay-what-you-on Gumroad,对于PDF,EPUB和MOBI版本(available in the next day).

Read more »

2017 in review

||MIRI Strategy

This post reviewsMIRI’s activities in 2017, including research, recruiting, exposition, and fundraising activities.

对于Miri来说,2017年是一个很大的过渡年,因为我们从事新的研究项目,这些项目对动手编程工作和实验更加依赖。亚博体育官网我们在2018年继续进行这些项目,在我们的2018 update。这意味着要比过去的重点更快地为增长速度奠定基础,包括建立基础设施以及改变我们招募的方式,以与更多具有工程背景的人接触。

Read more »

美里的最新新兵:爱德华·克梅特(Edward Kmett)!

||News

多产的Haskell开发人员Edward Kmetthas joined the MIRI team!

Edward is perhaps best known for popularizing the use of lenses for functional programming. Lenses are a tool that provides a compositional vocabulary for accessing parts of larger structures and describing what you want to do with those parts.

除了镜头图书馆之外,爱德华还保留了Haskell核心图书馆周围的所有图书馆,涵盖了从自动差异(深度学习,计算机视觉和财务风险)到类别理论(对组织软件的偏见)到图形的所有内容,SAT绑定,RCU方案,用于编写编译器的工具等等。

Initial support for Edward joining MIRI is coming in the form of funding from long-time MIRI donor Jaan Tallinn. Increased donor enthusiasm has put MIRI in a great position to take on more engineers in general, and to consider highly competitive salaries for top-of-their-field engineers like Edward who are interested in working with us.

在美里,爱德华是分裂时间之间的帮助ing us grow our research team and diving in on a line of research he’s been independently developing in the background for some time: building a new language and infrastructure to make it easier for people to write highly complex computer programs with known desirable properties. While we are big fans of his work, Edward’s research is independent of the directions we described in our2018 Update, and we don’t consider it part of our core research focus.

我们为Miri的爱德华感到非常兴奋。我们希望从我们的互动中学习并获得很多收益,我们也希望团队中的爱德华能够让他和其他Miri员工窃取彼此最好的解决问题的启发式方法,并随着时间的推移汇聚在研究方向上。亚博体育官网


As described in our recentupdate,我们的新研究很大程度上是在理论上的严亚博体育官网格和动手工程的结合中,Edward和功能编程社区众所周知:

我们所有新方法之间的共同点是专注于使用高级理论抽象来实现有关我们构建的系统的连贯推理。亚博体育苹果app官方下载这样的具体含义是,我们在Haskell中编写了许多代码,并且经常通过类型理论的镜头来思考我们的代码。

Miri的非营利性使命是确保曾经开发的人类AI系统更聪明,拥有一个亚博体育苹果app官方下载positive impacton the world. And we want to actually succeed in that goal, not just go through the motions of working on the problem.

Our current model of the challenges involved says that the central sticking point for future engineers will likely be that the building blocks of AI just aren’t sufficiently transparent. We think that someone, somewhere, needs to develop some new foundations and deep theory/insights, above and beyond what’s likely to arise from refining or scaling up currently standard techniques.

我们认为,功能性程序员的技能往往特别适合这种工作,我们认为我们的新研究领域可以吸收大量的程序员和计算机科学家。亚博体育官网因此,我们希望这一招聘公告将其作为一个招聘宣传:考虑考虑joining our research effort!

要了解有关在Miri工作的感觉,以及我们正在寻找的候选人,请参阅our last big post, or shoot MIRI researcher Buck Shlegerisan email

November 2018 Newsletter

||Newsletters

Miri的2018年筹款活动

||News

Update January 2019:Miri的2018年筹款活动现已结束。

$946,981
|

|
$0

|
$300,000

|
$600,000

|
$900,000

|
$1,200,000

筹款人得出结论

345个捐助者捐款


Miri是一项数学/CS研究非营利组亚博体育官网织,其使命是最大程度地提高人道的潜在人道主义利益,而不是人类人工智能。您可以进一步了解我们在Ensuring Smarter-Than-Human Intelligence Has A Positive Outcome“ 和 ”Embedded Agency。”

Our funding targets this year are based on a goal of raising enough in 2018 to match our “business-as-usual” budget next year. We view “make enough each year to pay for the next year” as a good heuristic for MIRI, given that we’re a quickly growing nonprofit with a healthy level of reserves and a budget dominated by researcher salaries.

Read more »

2018年更新:我们的新研究指示亚博体育官网

||MIRI Strategy,News

For many years, MIRI’s goal has been to resolve enough fundamental confusions aroundalignment和intelligence to enable humanity to think clearly about technical AI safety risks—and to do this before this technology advances to the point of potential catastrophe. This goal has always seemed to us to be difficult, but possible.1

去年,我们说我们正在启动针对这一目标的新研究计划。亚博体育官网2Here, we’re going to provide background on how we’re thinking about this new set of research directions, lay out some of the thinking behind our recent decision to do less default sharing of our research, and make the case for interested software engineers tojoin our team和help push our understanding forward.

Read more »


  1. This post is an amalgam put together by a variety of MIRI staff. The byline saying “Nate” means that I (Nate) endorse the post, and that many of the concepts and themes come in large part from me, and I wrote a decent number of the words. However, I did not write all of the words, and the concepts and themes were built in collaboration with a bunch of other MIRI staff. (This is roughly what bylines have meant on the MIRI blog for a while now, and it’s worth noting explicitly.)
  2. See our 2017strategic updatefundraiserposts for more details.

Embedded Curiosities

||yabo app

This is the conclusion of theEmbedded Agencyseries. Previous posts:

Embedded AgentsDecision TheoryEmbedded World-Models
Robust DelegationSubsystem Alignment


A final word on curiosity, and intellectual puzzles:

I described an embedded agent, Emmy, and said that I don’t understand how she evaluates her options, models the world, models herself, or decomposes and solves problems.

在过去,当研究人员讨论亚博体育官网tivations for working on problems like these, they’ve generally focused on the motivation fromAI risk。AI研亚博体育官网究人员想建造可以以人类的通用方式解决问题的机器,并且二元论不是思考此类系统的现实框架。亚博体育苹果app官方下载特别是,随着AI系统变得更聪明,它特别容易崩溃。亚博体育苹果app官方下载当人们弄清楚如何构建通用AI系统时,我们希望这些研究人员处于更好的位置,以了解其系统,分析其内亚博体育苹果app官方下载部属性并对他们的未来行为充满信心。亚博体育官网

This is the motivation for most researchers today who are working on things like updateless decision theory and subsystem alignment. We care about basic conceptual puzzles which we think we need to figure out in order to achieve confidence in future AI systems, and not have to rely quite so much on brute-force search or trial and error.

但是,为什么我们可能需要也可能不需要AI中特定的概念见解的论点很长。我没有试图在这里涉足该辩论的细节。相反,我一直在讨论一组特定的研究方向亚博体育官网intellectual puzzle, and not as an instrumental strategy.

One downside of discussing these problems as instrumental strategies is that it can lead to some misunderstandings about为什么we think this kind of work is so important. With the “instrumental strategies” lens, it’s tempting to draw a direct line from a given research problem to a given safety concern. But it’s not that I’m imagining real-world embedded systems being “too Bayesian” and this somehow causing problems, if we don’t figure out what’s wrong with current models of rational agency. It’s certainly not that I’m imagining future AI systems being written in second-order logic! In most cases, I’m not trying at all to draw direct lines between research problems andspecific AI failure modes

What I’m instead thinking about is this: We sure do seem to be working with the wrong basic concepts today when we try to think about what agency is, as seen by the fact that these concepts don’t transfer well to the more realistic embedded framework.

If AI developers in the future arestillworking with these confused and incomplete basic concepts as they try to actually build powerful real-world optimizers, that seems like a bad position to be in. And it seems like the research community is unlikely to figure most of this out by default in the course of just trying to develop more capable systems. Evolution certainly figured out how to build human brains without “understanding” any of this, via brute-force search.

嵌入式代理商是我试图指出我认为是一个非常重要且中心的地方的方式,我认为未来的研究人员也冒着陷入困境的风险。亚博体育官网

There’s also a lot of excellent AI alignment research that’s being done with an eye toward more direct applications; but I think of that safety research as having a different type signature than the puzzles I’ve talked about here.


Intellectual curiosity isn’t the ultimate reason we privilege these research directions. But there are somepracticaladvantages to orienting toward research questions from a place of curiosity at times, as opposed toonly applying the “practical impact” lens我们如何看待世界。

When we apply the curiosity lens to the world, we orient toward the sources of confusion preventing us from seeing clearly; the blank spots in our map, the flaws in our lens. It encourages re-checking assumptions and attending to blind spots, which is helpful as a psychological counterpoint to our “instrumental strategy” lens—the latter being more vulnerable to the urge to lean on whatever shaky premises we have on hand so we can get to more solidity and closure in our early thinking.

嵌入式代理is an organizing theme behind most, if not all, of our big curiosities. It seems like a central mystery underlying many concrete difficulties.

Subsystem Alignment

||yabo app


艾米嵌入式代理

You want to figure something out, but you don’t know how to do that yet.

您必须以某种方式将任务分解为子计算。没有“思考”的原子行为;智力必须由非智能部分建立。

由零件制成的代理是制造的一部分反事实hard, since the agent may have to reason about impossible configurations of those parts.

由零件制成是使自我调查和自我修饰even possible.

What we’re primarily going to discuss in this section, though, is another problem: when the agent is made of parts, there could beadversariesnot just in the external environment, but inside the agent as well.

This cluster of problems isSubsystem Alignment: ensuring that subsystems are not working at cross purposes; avoiding subprocesses optimizing for unintended goals.

  • benign induction
  • benign optimization
  • transparency
  • mesa-optimizers

Read more »