Ethical and Philosophical Dimensions of Artificial General Intelligence

Eleftheria DrosopoulouMarch 18th, 2026Last Updated: March 14th, 2026

0 201 12 minutes read

Examining the theoretical possibility of AGI, its implications on decision-making systems, human-like comprehension, consciousness, moral status, and the governance frameworks that must grow alongside them — drawing from peer-reviewed research 2024–2025.

Defining intelligence is, it turns out, a precondition for building it — and we have never quite managed the former. That unresolved question sits at the centre of every serious debate about Artificial General Intelligence: what it would mean for a machine to truly understand, whether that understanding would carry moral weight, and how humanity should govern a technology whose consequences are genuinely difficult to bound.

This article examines those questions in order. It draws on a growing body of interdisciplinary research — from computer science, philosophy of mind, ethics, cognitive science, and governance studies — to map out where the field actually stands, rather than where the most optimistic or most alarmed voices claim it stands.

Defining the Territory: ANI, AGI, and ASI

Precision about terminology matters here because conflation of three distinct concepts — narrow AI, general AI, and superintelligence — produces confused arguments in both directions. A 2025 systematic review published in Nature’s Scientific Reports, which analysed AGI literature across Scopus from 2003 to 2024, draws these distinctions sharply:

Table 1 — Scopus AGI literature review 2003–2024 · Nature / Scientific Reports 2025

Level	Full name	Definition	Scope	Status	Examples
ANI	Artificial Narrow Intelligence	Highly capable within a single domain; cannot generalise beyond training distribution	One task or domain	Where we are	GPT-4, AlphaFold, image classifiers, chess engines
AGI	Artificial General Intelligence	Performs any intellectual task a human can — understanding, learning, and reasoning across domains without task-specific training	All human cognitive domains	Unsolved	No confirmed instance exists; theoretical threshold
ASI	Artificial Superintelligence	Surpasses human intelligence in every domain including creativity, strategy, scientific reasoning, and social intelligence	Exceeds all human capability	Hypothetical	No instance; subject of existential risk literature (Bostrom, Russell)

Source: Navigating AGI development — Nature / Scientific Reports, March 2025 · BERTopic modelling of Scopus literature 2003–2024

Table 1 breaks out the ANI → AGI → ASI spectrum with definition, scope, real-world status badges, and concrete examples — drawn directly from the Scopus 2003–2024 literature distinctions.

As of early 2026, we remain firmly in the Narrow AI stage. Advances such as reasoning models and agentic frameworks represent progress from narrow toward broad AI — but the jump to genuine generalisation across arbitrary domains remains unsolved. At Google I/O 2025, DeepMind CEO Demis Hassabis suggested AGI could arrive by approximately 2030; Sam Altman of OpenAI has floated 2025–2028 as a plausible window, defining AGI as a system that outperforms humans in nearly all economically valuable tasks. Meanwhile, a 2025 survey of 582 AI researchers found 25% expected AI consciousness within ten years — and 60% expected it within their lifetimes.

The definition problem: One of the perennial problems highlighted by Northwestern University’s Center for AI Safety is that we do not even agree on what intelligence means — making it difficult to claim we are replicating it. As they put it, OpenAI’s five-level roadmap attempts to benchmark AI development levels without settling what “intelligence” actually means, a problem that undermines attempts to judge how close or far we are from any milestone.

Is AGI Theoretically Possible? The Philosophical Debate

The question of whether AGI is possible is not merely technical. It is deeply philosophical, and the most serious disagreements are not between optimists and pessimists about timelines — they are between thinkers with incompatible views about the nature of intelligence and mind itself. The Scientific Reports review notes that current deep learning systems, despite remarkable achievements in language and vision, lack the ability to generalise knowledge across domains or reason abstractly in novel situations — the very capabilities that would define AGI.

Table 2 — Capabilities gap: current deep learning vs AGI requirements · Scientific Reports 2025

Capability	Current deep learning	Required for AGI	Gap
Language	Strong — fluent generation, summarisation, translation at near-human level	Language grounded in genuine understanding of meaning, causality, and world model	Grounding gap — statistical fluency ≠ semantic understanding
Vision	Strong — object recognition, scene description, image generation exceed human baselines on benchmarks	Visual reasoning integrated with physical intuition and cross-modal abstraction	Reasoning gap — recognition does not imply causal visual reasoning
Cross-domain generalisation	Absent — knowledge learned in one domain does not reliably transfer to structurally different domains	Ability to apply principles from one domain to solve novel problems in another without retraining	Core AGI gap — identified as the defining unsolved problem
Abstract reasoning in novel situations	Limited — struggles on ARC-AGI and similar out-of-distribution reasoning benchmarks	Forming and applying abstract rules in genuinely new contexts never seen in training data	Core AGI gap — current models rely on pattern recall, not rule induction
Causal reasoning	Weak — LLMs approximate causal patterns through correlation; fail on interventional and counterfactual queries	Robust understanding of cause and effect enabling reliable planning and prediction	Structural gap — requires architectural changes beyond current transformers
Embodied / physical understanding	Minimal — disembodied training means no grounding in physical interaction or proprioception	Intuitive physics, motor planning, and environment interaction fundamental to general intelligence	Embodiment gap — pure language models lack physical world model entirely
Self-coherence over time	Absent — stateless between sessions; no persistent goals, memory, or evolving self-model	Stable identity, long-horizon goal pursuit, and accumulation of experience over time	Continuity gap — architectural constraint of current context-window-based systems

Source: Navigating AGI development — Nature / Scientific Reports, March 2025 · author analysis of identified capability boundaries

Table 2 maps seven capability dimensions — language, vision, cross-domain generalisation, abstract reasoning, causal reasoning, embodied understanding, and self-coherence — against what current deep learning actually delivers versus what AGI would require, with a named gap for each row so the shortfall is specific rather than vague.

A 2025 paper in IJCRT observes that contemporary cognitive science has moved beyond simple computational models of mind. The human mind is understood as a complex, embodied system shaped by biology, affect, and social context — not merely a symbol-manipulating engine. Recent large language models exhibit behaviours analogous to human cognitive tendencies, while simultaneously revealing limits in self-coherence, embodiment, and experiential grounding. Furthermore, as the paper notes, AI may represent a third “decentering” of human exceptionalism — after the Copernican revolution and Darwinian evolution — challenging humanity’s self-understanding in ways that have yet to be fully metabolised.

“We still don’t even agree on what intelligence is, so it’s hard to replicate it. What is our method: mimic human brains, or create entirely new architectures? Neuroscience, philosophy, and cognitive science all have competing models.”— USAII, Artificial General Intelligence: Challenges and Opportunities Ahead, 2025

Consciousness, Sentience, and the Hard Problem

Even if AGI became technically feasible, a separate and arguably harder question immediately arises: would it be conscious? And if it were, what would follow from that? A 2025 senior thesis from Claremont McKenna College, drawing on David Chalmers’ philosophy of mind, distinguishes two concepts that are frequently conflated:

Functional consciousness concerns the causal role mental states play in producing behaviour — a system exhibits functional consciousness if it responds to stimuli, learns, and adapts in ways that parallel conscious beings. Many AI systems already exhibit this in limited domains. Phenomenal consciousness — the “what it is like” of subjective experience — is the harder question. It asks whether there is something it feels like to be the system, whether there is genuine inner experience rather than behavioural mimicry.

Schwitzgebel’s survey data (2025): In a pre-publication manuscript at the University of California Riverside, philosopher Eric Schwitzgebel reports that in a 2024 survey of 582 AI researchers, 25% expected AI consciousness within ten years and 60% within their lifetimes — numbers that contrast sharply with the scepticism of leading consciousness theorists including Anil Seth, Peter Godfrey-Smith, Ned Block, and John Searle, all of whom doubt near-term AI phenomenal consciousness is possible. (Schwitzgebel, 2025)

A 2025 overview of artificial consciousness research on arXiv raises a particularly unsettling concern: the potential for mass creation of artificial agents at an unprecedented scale introduces the alarming possibility of generating forms of suffering that are not only vast in magnitude but potentially beyond human comprehension. Given the current epistemic uncertainty surrounding phenomenal consciousness, developments in this direction could result in what philosopher Thomas Metzinger calls a “suffering explosion” — amplified by our inability to reliably assess or mitigate these experiences.

AGI Decision-Making: Autonomy, Accountability, and the Alignment Problem

Separate from questions of consciousness, AGI raises profound governance and ethical questions about decision-making. A comprehensive paper in Springer’s AI and Ethics journal (2025) identifies accountability as one of the central challenges: when multiple entities and systems work together throughout a single decision-making process, determining moral responsibility for outcomes becomes genuinely difficult.

The value alignment problem — ensuring that an AGI optimises for what humans actually value, rather than a subtly incorrect proxy — is both technically and philosophically demanding. As philosopher I. Gabriel noted in a widely cited 2020 paper, it requires translating human values into precise mathematical objective functions that are robust to optimisation pressure. A function that fails this test will be exploited: the AGI will find unexpected ways to maximise its formal objective while violating the spirit of what was intended — a pattern Nick Bostrom described as early as 2003 with the “paperclip maximiser” thought experiment.

The trolley problem at scale: When an autonomous vehicle must choose between two harmful outcomes, or when a medical AGI must allocate scarce resources, the question of how the system is programmed to make ethical decisions is no longer academic. Traditional ethical frameworks — consequentialism, deontology, virtue ethics — were developed for humans deliberating under normal cognitive constraints. Applying them to AGI systems capable of evaluating millions of variables simultaneously requires careful philosophical re-examination, not mechanical translation.

The Scientific Reports study identifies explainability as one of five key pathways shaping AGI’s responsible trajectory, alongside societal integration, technological advancement, cognitive and ethical considerations, and brain-inspired systems. The explainability pathway addresses a fundamental tension: AGI systems built on deep learning are inherently opaque, yet accountability and public trust require that decisions — especially high-stakes ones in healthcare, justice, and finance — be interpretable by the humans affected by them.

AGI timeline predictions: distribution of expert forecasts

Expert timeline survey — AI researchers 2024–2025: Sources: Schwitzgebel, UCR 2025 · USAII 2025 · CloudWalk AI 2025 · n=582

Moral Status: Could an AGI Have Rights?

This is perhaps the most philosophically charged question the AGI era raises — and also the one most frequently dismissed as premature. A 2025 paper in Oxford’s Analytic Philosophy journal argues that the question of moral status cannot safely be deferred. The paper distinguishes two dangers in AI development that must be navigated simultaneously: creating misaligned AI systems that pose a threat to humanity, and mistreating AI systems that merit moral consideration in their own right. Crucially, the authors argue these two dangers interact — if we create AI systems that merit moral consideration, simultaneously avoiding both dangers becomes extremely challenging.

A 2025 arXiv preprint on True Intelligence draws a similar distinction. The ethical concerns surrounding standard AGI are primarily about control and alignment — ensuring a powerful non-conscious tool acts in accordance with human values. But the ethical concerns surrounding genuinely conscious artificial intelligence are of an entirely different nature: they involve questions of moral status, rights, and the potential for suffering in a newly created conscious entity. Can a conscious entity be “owned”? Is it ethical to “turn off” such a being?

Ethical framework	Position on AGI moral status	Key implication	Limitation
Consequentialism	Status depends on capacity for wellbeing or suffering — if AGI can suffer, it has moral weight	Would require minimising AGI suffering if it exists; could demand rights	Epistemic gap — we cannot reliably detect AI suffering
Deontology (Kantian)	Status depends on rational autonomy — a genuinely rational agent has inherent dignity	AGI capable of autonomous rational action could be owed duties, not just considered instrumentally	Definition problem — what counts as genuine rational autonomy?
Virtue Ethics	Focus on what kind of entities we become in our relationship with AGI	Treating AGI as mere tools may cultivate vices (cruelty, indifference) relevant to our treatment of each other	Indirect only — does not directly address AGI’s own status
Care Ethics	Focus on relational obligations — those we care for acquire moral significance through relationship	AI companions and social robots in healthcare settings may generate genuine caring obligations	Deception risk — emotional bonds with non-sentient systems may be ethically problematic
Functionalism	Mental states are defined by functional role, not substrate — sufficiently functional AGI may have genuine mental states	Opens the door to AGI rights based on demonstrated functional equivalence to human cognition	Philosophical contested — Searle’s Chinese Room challenges whether function entails understanding

Societal Impact: Labour, Inequality, and Human Identity

Beyond the philosophical frontier, the Springer AI & Ethics paper offers a granular account of AGI’s near-term societal implications, well before any consciousness question becomes pressing. The economic dimension is stark: workforce disruption, income inequality, and productivity gains are all plausible consequences, and their distribution depends almost entirely on governance choices made in advance of AGI arrival, not in response to it. Without proactive redistribution mechanisms, productivity gains concentrate at the top of the income distribution while displacement falls on lower-skilled workers.

The same paper identifies national security as a distinct political implication: AGI capable of autonomous strategic reasoning raises urgent questions about weapons development, cyber operations, and the stability of deterrence frameworks designed for human decision-makers. The potential for misuse — by state actors, non-state actors, or corporations — is not hypothetical. It is a present-tense concern shaping how the technology is being developed right now.

Governance is lagging: A 2025 governance analysis notes that despite the November 2023 Bletchley Declaration — signed by 28 nations and the EU — and the January 2025 Paris AI Action Summit, AI governance is struggling to keep pace with technological advancement. Key challenges include crafting regulations that remain effective as AI rapidly evolves, ensuring international cooperation to prevent an ASI arms race, and enforcing compliance beyond major corporations. On January 20, 2025, the incoming US administration rescinded the Biden executive order on AI safety, replacing it with a policy emphasising “AI dominance over regulation.”

Relative research attention across AGI development pathways (Scopus 2003–2024)

Source: Navigating AGI development — Nature / Scientific Reports, March 2025 · BERTopic modelling of Scopus literature 2003–2024

Brain-Inspired Pathways: Neuroscience Meets AI

The Scientific Reports review identifies brain-inspired systems as one of five key AGI pathways — an approach that attempts to close the gap between current AI and genuine intelligence by mimicking the human brain’s architecture. Some researchers have proposed incorporating elements such as sensory processing, memory storage, and attention mechanisms drawn directly from neuroscience. The European Human Brain Project, for instance, attempts to combine symbolic reasoning with deep learning, bridging the gap between AI that reasons logically and AI that generalises statistically.

Researchers from Google’s DeepMind and Harvard have used deep reinforcement learning within a biomechanically realistic simulation of a rat to model complex motor behaviour — work that advances understanding of how the brain implements motor control. Transfer learning, which allows AI systems to apply knowledge from one domain to another, represents another path toward generalisation, though current systems still fall well short of human-level domain transfer.

The philosophical implications of the neuroscience-inspired approach are notable: if AGI is achieved by faithfully replicating the functional architecture of a conscious brain, the question of whether the result is conscious becomes substantially harder to dismiss. A system built to mirror the mechanisms of consciousness, using models derived from the study of biological consciousness, has a stronger prima facie case for phenomenal experience than one that merely produces correct outputs through pattern matching.

What Responsible Development Actually Requires

A 2024 critical review in the World Journal of Advanced Research and Reviews identifies three interconnected requirements for responsible AGI development that have clear implications for researchers, institutions, and policymakers today:

Transparency and traceabilityEstablishing mechanisms for tracing decision-making processes, disclosing information about data sources and model architecture, and holding developers accountable for outcomes — not just intentions.
Proactive risk assessmentIdentifying risks early in the development process rather than responding after deployment. This includes potential failure modes that are not adversarial — systems that behave safely in testing but fail unexpectedly at scale or in novel contexts.
Interdisciplinary governanceEngaging not just technologists but philosophers, ethicists, social scientists, economists, and affected communities in shaping development norms. The NIST AI Risk Management Framework (2023) represents one attempt at such integration, though it is voluntary and US-specific.
Avoiding premature anthropomorphismA 2025 paper in Taylor & Francis’s Internet Policy journal argues that AGI is in several respects an unscientific myth built on three fallacies: the idea that machine intelligence can achieve limitless generality, anthropomorphism (attributing goals and self-preservation to machines), and omnipotence. Governance distorted by these fallacies will misallocate regulatory attention.
Deferring morally dangerous developmentThe Oxford Analytic Philosophy paper proposes that we should delay the development of AI systems that merit moral consideration unless and until we can simultaneously ensure alignment and ethical treatment — a standard that current techniques cannot meet.

The interdisciplinary consensus: Across sources ranging from Nature to Springer to Oxford to arXiv preprints, one point emerges consistently — the Scientific Reports review puts it directly: “This endeavour is not merely technical but philosophical, questioning the very nature of intelligence and consciousness.” No technical solution to alignment, consciousness detection, or governance can bypass the philosophical questions. They are the foundation, not the decoration.

What We Have Learned

The ethical and philosophical dimensions of AGI are not peripheral concerns that can be addressed after the engineering is done — they are structural preconditions for doing the engineering responsibly. This survey of 2024–2025 research across multiple disciplines yields several interconnected conclusions. First, definitional rigour matters: conflating narrow AI, AGI, and superintelligence in public discourse produces both false urgency and false reassurance.

We remain in the narrow AI stage; AGI is a distinct and unsolved problem. Second, the philosophical debate about whether machines can genuinely understand or be conscious is unresolved and irreducible — functional approaches (Turing, Putnam) and sceptical approaches (Searle, Descartes) identify real and persistent difficulties, not merely rhetorical positions.

Third, if AGI ever approaches genuine consciousness, the questions of moral status, rights, and ethical treatment that follow are not optional — they interact directly with the alignment problem, and getting them wrong simultaneously creates misaligned systems and morally compromised ones. Fourth, near-term societal impacts — workforce disruption, income concentration, national security risks — are already developing and require governance responses that current international frameworks are not providing quickly enough.

Finally, across all five pathways identified in the Scientific Reports systematic review — societal integration, technological advancement, explainability, cognitive and ethical considerations, and brain-inspired systems — the consistent message is that responsible AGI development requires interdisciplinary collaboration, philosophical depth, and a willingness to slow down when the risks of proceeding exceed the governance capacity to manage them.

Ethical and Philosophical Dimensions of Artificial General Intelligence

Defining the Territory: ANI, AGI, and ASI

Is AGI Theoretically Possible? The Philosophical Debate

Consciousness, Sentience, and the Hard Problem

AGI Decision-Making: Autonomy, Accountability, and the Alignment Problem

Moral Status: Could an AGI Have Rights?

Societal Impact: Labour, Inequality, and Human Identity

Brain-Inspired Pathways: Neuroscience Meets AI

What Responsible Development Actually Requires

What We Have Learned

Thank you!

Eleftheria Drosopoulou

Thank you!

Defining the Territory: ANI, AGI, and ASI

Is AGI Theoretically Possible? The Philosophical Debate

Consciousness, Sentience, and the Hard Problem

AGI Decision-Making: Autonomy, Accountability, and the Alignment Problem

Moral Status: Could an AGI Have Rights?

Societal Impact: Labour, Inequality, and Human Identity

Brain-Inspired Pathways: Neuroscience Meets AI

What Responsible Development Actually Requires

What We Have Learned

Thank you!

Related Articles

Thank you!