Open Research Questions

This page is out-of-date. You can read about our current priority areas here. Our research agenda on cooperation, conflict, and transformative artificial intelligence can be found here.

There are a number of crucial considerations for reducing suffering in humanity's future. This page presents a ranked list of topics that the Center on Long-Term Risk considers important to investigate. Let us know if you'd like to help research these topics. Some are most appropriately addressed by reviewing existing literature and summarizing it on Wikipedia. Other topics require novel exploration.

Contents

Top questions
Other topics

Top questions

Suffering from controlled vs. uncontrolled artificial intelligence

Priority: 10/10
Output format: Mostly novel research

It's likely that artificial intelligence (AI) in some form will hold the reigns of power over Earth's future within the coming centuries, barring economic or societal collapse in the interim. Depending on the dynamics of how AI is developed and how unpredictable AI behaviors are, humans may keep hands on the steering wheel of how AI is shaped, or AI might take a direction of its own due to economically outcompeting humans, oversights by its programmers, or other factors.

Organizations like the Machine Intelligence Research Institute and the Future of Humanity Institute have explored the implications of various AI takeoff scenarios for human flourishing, but less attention has been given to the implications of various types of AIs for future suffering. It's plausible that some AI trajectories will cause significantly more suffering than others, but which ones?

Brian Tomasik has sketched some of his guesses about ways in which different types of AIs would cause suffering, but this is just a start. We need a thorough research program on this topic. Some relevant questions include:

How convergent are various types of computations for any type of advanced AI civilization? What fraction of its light cone would an AI devote to learning and other instrumental computations vs. what fraction would go toward creating the structures that it intrinsically values?
What kinds of computations are likely to be run only by human-controlled AI? By uncontrolled AI?
Would uncontrolled AIs use animal-like robots or lower-level nanotechnology to accomplish most of their engineering tasks? Would low-level nanotech suffer less than robots? What does this imply about the extent of instrumental suffering given uncontrolled AI?
Would AIs run lots of simulations for scientific purposes? How many computing resources would they require to achieve what level of accuracy?
Develop a taxonomy for AI types that's broader than just the distinction between controlled vs. uncontrolled. For example, human-controlled AI could mean AI where decisions are made democratically, in an authoritarian fashion, or by economic competition. Uncontrolled AI could include maximizers of something, minimizers, societies of many AI agents, and so on. Each of these more detailed AI scenarios may involve different levels of expected suffering.

We should also explore whether there are particular forms of AI-safety research that are more targeted relative to the value of suffering reduction. For instance, are there ways we can ensure that even if AIs fail to achieve human goals, they at least "fail safe" and don't cause astronomical amounts of suffering? And even if suffering reducers don't support AI safety wholesale (which, as mentioned, seems unlikely), are there particular components of AI safety that they would support and should promote further?

Suffering-focused ethics

Priority: 8/10

Many views in ethics and value theory see preventing suffering as particularly important. Such views include Negative Utilitarianism but also other views in population ethics, axiology, and normative ethics. New research in this vein or presentations of such views to a general audience can build on the works we list in our bibliography. Below are examples of more specific topics.

Overview of suffering-focused views.
- Create a bibliography of suffering-focused views. Priority 8/10 (more).
- Improve this Wikipedia article on negative Utilitarianism, and this one on negative consequentialism. Priority 8/10.
Antifrustrationism.
- Christoph Fehige proposed antifrustrationism, according to which a frustrated preference is bad, but the existence of a satisfied preference is not better than if the preference didn’t exist in the first place. Several authors have objected to antifrustrationism. How could a proponent of antifrustrationism respond? Priority 8/10 (more).
- Improve this Wikipedia article on antifrustrationism. Priority 8/10.
Descriptive ethics. What fraction of people hold suffering-focused views? What are people's opinions on various negative-leaning thought experiments like Omelas? See also the essay Descriptive Ethics and Its Relevance for Cause Prioritization. Priority 8/10.
Tradeoffs between good and bad parts of lives. In discussions about the disvalue of bad parts of life compared to the value of good parts of life, one idea that comes up is what tradeoffs someone makes or would make. A person might say “I would accept 1 day of torture in exchange for living 10 extra happy years.” What, if anything, can be concluded from the actual or hypothetical tradeoffs people make? Priority 8/10 (more).
Applications. What are interesting practical implications of suffering-focused views? An example is the essay Omelas and Space Colonization. Priority 8/10.
More research questions on suffering-focused ethics.

AI takeoff scenarios

Priority: 6/10
Output format: Wikipedia contributions and novel research

Futurists debate what AI will look like when it arrives. Some like Eliezer Yudkowsky and Nick Bostrom have argued in favor of the possibility of a "hard takeoff" in which a single AI or small team of AI creators can rapidly self-improve to the point of unilaterally taking over the world. Others, like Robin Hanson and J. Storrs Hall, have argued for a "soft takeoff" in which AI is integrated into society as a whole, and the rapid self-improvement occurs in a similar way as the exponential economic growth that we see already. Another possibility is AI arms races among several powerful countries, in which militaries aim to outcompete each other in fashion reminiscent of the Cold War.

It would help to develop a taxonomy of AI trajectories more fine-grained than the hard-vs.-soft distinction.
What can the study of economic growth tell us about AI takeoff dynamics?
Even if we think a soft takeoff is most likely, how probable is a hard takeoff? Should we expend resources thinking about those possibilities?
Who will control development of the first general AI? US military? Chinese military? Google? Wealthy investors? Private individuals?
Will whole-brain emulation or bottom-up AI come first? Will it use neuromorphic algorithms or more abstract ones? Will it use evolutionary algorithms or intelligent design? Will it be neat or scruffy or both? And so on.
What fraction of AIs would be maximizers of something and what fraction minimizers? What fraction would be neither? What kinds of goal functions are likely?
What forces will determine how the AI is shaped? Democratic vote? Financial incentives? Opinions of wealthy investors? Scientists?
Given the above, how can we best influence AI in positive directions? Spreading good values throughout society? Networking with tech leaders? Influencing the US military?
Is the work of the Machine Intelligence Research Institute on the right track, or is it too theoretical?
How likely is it that elites would figure out AI safety on their own?
Would open-source AI development increase or decrease the probability that humans retain control of the AI's behavior? (Arguments for increase: 1. More eyes would be checking the code, searching for problems. 2. AI-control researchers would be able to reason better about their topic if they could see AI source code rather than guessing about what was happening in the secret offices of a company or government agency. Arguments for decrease: 1. Non-experts would be able to run AIs without the same safeguards that might be developed by private AI teams. 2. AIs could be downloaded and run by people with malicious intent. 3. Open-sourcing AI development might hasten its arrival, allowing less time to think about control issues. 4. Open-sourcing would allow more total parties to have powerful AIs, potentially making worldwide cooperation more difficult and increasing the risk of conflict scenarios.) See also "Should AI Be Open?"

Anthropic reasoning and mediocrity

Priority: 4/10
Output format: Wikipedia contributions and novel research

Anthropic reasoning aims to gain insight about our place in the universe based on the facts that we exist and find ourselves in a particular time and context. As an example, it's sometimes claimed that human civilization is unlikely to last vastly longer than it has already, because if we consider ourselves a random sample from all humans, we would expect to have been born much later in history. This is called the "doomsday argument" and is one controversial application of anthropic argumentation. Many thinkers reject the doomsday argument, though they differ widely on the reasons for its rejection. Some argue that a narrow reference class of observers can solve the problem. Others suggest giving higher a priori probability to scenarios with more total observers. Yet others propose eliminating the notion of discrete observers within a reference class altogether. In general, the best approach to anthropics has not been "solved".

One example of anthropic-type thinking is the principle of mediocrity -- a Copernican intuition that we should expect ourselves to be typical observers in the universe. This idea seems at odds with the fact that we appear to be in an extremely influential time in the history of our galaxy. We live during some of the generations that may create and determine the constitution of AIs that colonize our region of the universe. What does anthropics have to say about this? Should we think the far future is much less likely to happen than we naively would have believed?

Anthropic-type ideas like the Fermi paradox, Great Filter, and timeline for evolution of life on Earth can provide further suggestions about how hard superintelligence is and how it behaves once created.

Open Research Questions

Contents

Open Research Questions

Top questions

Suffering from controlled vs. uncontrolled artificial intelligence

Suffering-focused ethics

AI takeoff scenarios

Anthropic reasoning and mediocrity

Other topics

Wild-animal suffering

Future evolution

Trajectory changes

International cooperation

Suffering in physics

Extraterrestrial life

Epistemology

Moral psychology

Strategy

Choose your own topic