Is AI alignment computable? | Manifold

Is AI alignment computable?

Basic

6

Ṁ155

2089

34%

chance

1D

1W

1M

ALL

This question is managed and resolved by Manifold.

Get

1,000

and

3.00

Sort by:

careful here. What normal people say when they say does category X have property Y? Is ‘does it have it often, most of the time, etc’. But when computer scientists or mathematicians ask that they mean ‘does every single instance of X have property Y’ which in the real world happens almost never except for very easily formalizable things. Yes, almost almost always, AI alignment is computable. Because you can compute it to arbitrary (up to very very high physical bounds) precision in pretty much all practical cases. But they will try to say it isn’t, because of some conceivable but implausible situation, or because it isn’t computable with infinite precision, or some other foolishness. I stand with the normies!

This reduces to the halting problem when the AI is not supposed to be in an infinite loop, so no

@cockathiel what do you mean by “computable”? Is this just in principle? (If so I think most things should be computable in principle or at least have very good computable approximations, because “computable” is an extremely broad and flexible category.)

Related questions

Conditional on their being no AI takeoff before 2030, will the majority of AI researchers believe that AI alignment is solved?

Conditional on AI alignment being solved, will governments or other entities be capable of enforcing use of aligned AIs?

Will I focus on the AI alignment problem for the rest of my life?

How difficult will Anthropic say the AI alignment problem is?

Will xAI significantly rework their alignment plan by the start of 2026?

Conditional on their being no AI takeoff before 2050, will the majority of AI researchers believe that AI alignment is solved?

Will Inner or Outer AI alignment be considered "mostly solved" first?

Will Meta AI start an AGI alignment team before 2026?

Will some piece of AI capabilities research done in 2023 or after be net-positive for AI alignment research?

Will deceptive misalignment occur in any AI system before 2030?

Related questions

Conditional on their being no AI takeoff before 2030, will the majority of AI researchers believe that AI alignment is solved?

Conditional on their being no AI takeoff before 2050, will the majority of AI researchers believe that AI alignment is solved?

Conditional on AI alignment being solved, will governments or other entities be capable of enforcing use of aligned AIs?

Will Inner or Outer AI alignment be considered "mostly solved" first?

Will I focus on the AI alignment problem for the rest of my life?

Will Meta AI start an AGI alignment team before 2026?

How difficult will Anthropic say the AI alignment problem is?

Will some piece of AI capabilities research done in 2023 or after be net-positive for AI alignment research?

Will xAI significantly rework their alignment plan by the start of 2026?

Will deceptive misalignment occur in any AI system before 2030?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules