SShortSingh.
Back to feed

Study Explores Whether Large Language Models Can Pass the Mirror Self-Awareness Test

0
·1 views

A recent blog post by Pascal Schuster investigates whether large language models (LLMs) can pass the mirror test, a classic measure of self-awareness used in animal cognition research. The mirror test traditionally determines if a creature can recognize its own reflection, serving as a proxy for self-awareness. The article applies this concept to AI language models to probe whether they exhibit any form of self-recognition or awareness. The post has drawn early attention on Hacker News, sparking discussion about the nature of machine cognition. The question touches on broader debates in AI research regarding consciousness and self-modeling in neural language systems.

Read the full story at Hacker News

This is an AI-generated summary. ShortSingh links to the original source for the complete article.

Discussion (0)

Log in to join the discussion and vote.

Log in

Related stories

0
ProgrammingDEV Community ·

Developer shares 7 reusable AI prompts that cut weekly coding time by 10 hours

A software developer on DEV Community documented every AI prompt used over 30 days and identified seven that proved consistently valuable across their workflow. The prompts are structured with fillable bracketed placeholders and cover common developer tasks including code explanation, debugging, unit test writing, and refactoring. Additional prompts address self-review of code diffs before pull requests, auto-generating concise documentation, and writing conventional commit messages. Each prompt includes specific constraints — such as ranking bug causes by probability or capping documentation at 150 words — designed to produce focused, actionable output rather than generic responses. The developer claims the combined use of these prompts saves over 10 hours per week by reducing time spent on repetitive or context-heavy coding tasks.

0
ProgrammingDEV Community ·

WHOIS Protocol Failing Silently on Up to 60% of Lookups as RDAP Takes Over

The decades-old WHOIS protocol, which relies on unstructured plain-text responses over TCP port 43, has been deteriorating for years and is now failing on an estimated 30–60% of domain lookups. ICANN's RDAP compliance mandate, enforced from August 2024, effectively broke legacy WHOIS endpoints for gTLD registrars still using them. Simultaneously, major data providers like WhoisXML API eliminated free tiers and raised pricing minimums, pushing teams toward paid alternatives. RDAP, standardized via RFC 7480–7484, offers a structured JSON-based replacement that addresses WHOIS's lack of schema, encoding standards, and error signaling. In response, developers have built RDAP-first tools — such as the Apify actor described here — that fall back to legacy WHOIS only for unmigrated TLDs and return a unified JSON schema across all domains.

0
ProgrammingDEV Community ·

Developer Builds Golang Security System Designed to Destroy Itself If Compromised

A developer has published an open-source project called TITAN NEXUS, arguing that standard end-to-end encryption leaves data vulnerable once it reaches cloud provider infrastructure, where hypervisors can access RAM. The project, built in Golang, attempts to address this by pinning cryptographic keys in isolated, non-pageable memory and limiting their lifespan to fractions of a millisecond. A core feature is a 'cryptographic suicide' mechanism that actively zeroes out and poisons the system's memory state if it detects a RAM snapshot, hibernation event, or unprivileged interrupt. The developer contends that true zero-trust security requires treating cloud infrastructure itself as a potential adversary, going beyond existing approaches like Confidential Computing. The project's code has been shared on GitHub, with the author inviting feedback from security researchers and cloud architects.

0
ProgrammingDEV Community ·

Developer Builds CIS-Hardened CentOS 9 Golden Images with Packer and QEMU on WSL2

A software developer has published a method for building production-grade CentOS 9 Stream golden images entirely on a Windows laptop using WSL2 with nested KVM, eliminating the need for a cloud builder or dedicated Linux machine. The pipeline uses Packer and QEMU to produce a QCOW2 image that is CIS Level 1 hardened via the ansible-lockdown role and pre-loaded with PingAccess 8.3.5 on a JRE 17 runtime. The build process is split into two sequential Packer stages — one for base OS hardening and one for application layering — each driven by a single shell script with targeted build options. A signed Software Bill of Materials (SBOM) and VEX attestations are generated so that Trivy vulnerability scans can distinguish actually exploitable issues from theoretical ones. The guide also documents WSL2-specific pitfalls, such as a CIS sudoers rule that deadlocks Ansible pipelining and a noexec-mounted /tmp partition that breaks PingAccess installation assumptions.

Study Explores Whether Large Language Models Can Pass the Mirror Self-Awareness Test · ShortSingh