Turing Test Proposals

Polyphemus · August 5, 2017, 10:58pm

Are we making a joint effort at this? Are we making an ETARC submission, or are we just sending in our own ideas individually?

Because if we’re going to make a collective test paper, we need to start collating ideas, and voting on things.

And we need more ideas. Original ones.

JoeyST · August 5, 2017, 11:01pm

I sent in 16 questions.

BlackIris · August 5, 2017, 11:02pm

As the ETARC CSD one of us could send a joint effort test. Some have already sent their own. It would be cool if ETARC’s test was chosen.

The above list of questions was a start in that direction.

Esequiel · August 5, 2017, 11:19pm

I’m more interested in seeing if the potential AI is like Skynet…

Polyphemus · August 5, 2017, 11:22pm

Apart from one, the list of questions I submitted was supposed to work in couplets. There is one question most humans would easily pass, and another, very similar one, that was more morally ambiguous.

In submitting the questions to a computer, you’ve removed half of my questions.

But a crucial part of a successful Turing test is not just trying to identify the computer. Half of the task is to identify the humans. Once you’re fairly sure who the humans are, what’s left is probably computer.

BlackIris · August 5, 2017, 11:25pm

@Polyphemus I understood the dual question format. Miksuku was able to reasonably answer the ones I left out.

bcatrek · August 5, 2017, 11:32pm

How many questions should there be in the test? Is there a rule or format for that?

BlackIris · August 5, 2017, 11:33pm

Suggestions have been between 5 and 10, but some have sent in more.

Polyphemus · August 5, 2017, 11:41pm

There is a long running bet on a computer passing the Turing test http://www.kurzweilai.net/a-wager-on-the-turing-test-the-rules. In that protocol, the examination takes two hours. There is no limit to the questions in that period.

WT have given no indication of what they consider a reasonable number of questions.

BlackIris · August 5, 2017, 11:41pm

@Polyphemus For the mentally challenged (ME) if we were actually testing an AI with a Turing Test it would essentially be Ockham’s Razor?

Polyphemus · August 5, 2017, 11:50pm

Ockam’s Razor is a philosophy of the “line of least resistance”. “Entities should not be multiplied unnecessarily” - i.e. the answer that requires the fewest assumptions is probably the correct one.

But in this case, we’re dealing with some very subtle distinctions. Facts and logic are things computers are very good at.

Emotions and human dilemmas, however, are not necessarily logical or factual. And I think that’s the way to go at this point. Consider how an average group of people would feel about a situation. Not clever people, or well-read people. Just ordinary, quirky, idiosyncratic people.

BlackIris · August 5, 2017, 11:56pm

I agree. By Ockham’s Razor I kind of meant what I was seeing Miksuku do. She mostly didn’t answer the second part of a 2 part question or side stepped or ignored “emotion” questions. If compared to a human’s answers addressing “emotion” questions and handling 2 part or dual questions would make it possible to identify the AI and the human.

kerdorin · August 6, 2017, 12:05am

I agree, an ETARC’s test sounds cool. i will not send a personal one (too tired, 2 AM here) and i think we can use some in the list.

Polyphemus · August 6, 2017, 12:06am

I’ll think up a few more questions.

BlackIris · August 6, 2017, 12:09am

You can run them by Miksuku at http://www.mitsuku.com/

CloaknBlagger · August 6, 2017, 12:10am

I have been chatting with Miksuku, I find that if I ‘converse’ under the assumption of talking to a real person, and do not try to ‘trip up’ the AI, the AI becomes much easier to spot. The AI handles single questions quite well but small talk soon fails.

BlackIris · August 6, 2017, 12:14am

If the AI we are testing is Loop16 (Emily) that line gets blurred. She was pretty convincing.

BlackIris · August 6, 2017, 12:17am

@Polyphemus We are running out of time.

MrBacon · August 6, 2017, 12:18am

I think it’s too late, correct me if I’m wrong, but, it said 20:00 and I think EST it’s past 20:00? Wish we could have done it though, it was a cool idea.

BlackIris · August 6, 2017, 12:19am

20:30 by the reddit post

Topic		Replies	Views
Twitch (screens and hampster cage) discussion Waking Titan Investigation	1077	13829	August 21, 2025
Phase 3 live test Waking Titan Investigation	1062	12509	August 21, 2025
Just received my survey! Waking Titan Investigation	91	4159	August 21, 2025
Phase 3 day 2 Waking Titan Investigation	389	9059	August 21, 2025
June 14th-June 19th - Uplink App Update In Progress Waking Titan	210	5979	August 21, 2025

Turing Test Proposals

Related topics