Amb and Search
Search and amb
- SICP Comparison Edition" />
4.3.1
<span style="color:green">Amb and Search</span>
<span style="color:blue">Search and amb</span>
- SICP Comparison Edition
To extend
SchemeJavaScript
to support nondeterminism, we introduce a new
special formsyntactic form
called amb.[1]
The expression
(amb $e_1\ e_2\ldots e_n$)amb($e_1,\ e_2,\ldots, e_n$)
returns the value of one of the $n$ expressions
$e_i$ ambiguously. For example,
the expression
Amb
An amb expression
with a single choice produces an ordinary (single) value.
Amb
An amb expression
with no choices—the expression
(amb)—isamb()—is
an expression with no acceptable values. Operationally, we can think of
(amb)amb()
as an expression that when evaluated causes the computation to
fail: The computation aborts and no value is produced.
Using this idea, we can express the requirement that a particular predicate
expression p must be true as follows:
Original
JavaScript
(define (require p)
(if (not p) (amb)))
function require(p) {
if (! p) {
amb();
} else {}
}
With amb and
require, we can implement the
an-element-of procedurean_element_of function
used above:
function an_element_of(items) {
require(! is_null(items));
return amb(head(items), an_element_of(tail(items)));
}
An-element-of
An application of an_element_of
fails if the list is empty. Otherwise it ambiguously returns either the
first element of the list or an element chosen from the rest of the list.
We can also express infinite ranges of choices. The following
procedurefunction
potentially returns any integer greater than or equal to some
given $n$:
Original
JavaScript
(define (an-integer-starting-from n)
(amb n (an-integer-starting-from (+ n 1))))
function an_integer_starting_from(n) {
return amb(n, an_integer_starting_from(n + 1));
}
This is like the stream
procedure
integers-starting-fromfunction
integers_starting_from
described in section 3.5.2, but with an
important difference: The stream
procedurefunction
returns an object that represents the sequence of all integers beginning
with $n$, whereas the
ambprocedurefunction
returns a single integer.[2]
Abstractly, we can imagine that evaluating an
amb expression causes
time to split into
branches, where the computation continues on each branch with one of the
possible values of the expression. We say that
amb represents a
nondeterministic choice point. If we had a machine with a
sufficient number of processors that could be dynamically allocated, we
could implement the search in a straightforward way. Execution would
proceed as in a sequential machine, until an amb
expression is encountered. At this point, more processors would be allocated
and initialized to continue all of the parallel executions implied by the
choice. Each processor would proceed sequentially as if it were the only
choice, until it either terminates by encountering a failure, or it further
subdivides, or it finishes.[3]
On the other hand, if we have a machine that can execute only one process
(or a few concurrent processes), we must consider the alternatives
sequentially. One could imagine modifying an evaluator to pick at random a
branch to follow whenever it encounters a choice point. Random choice,
however, can easily lead to failing values. We might try running the
evaluator over and over, making random choices and hoping to find a
non-failing value, but it is better to
systematically search all possible execution paths. The
amb evaluator that we will develop and work
with in this section implements a systematic search as follows: When the
evaluator encounters an application of amb, it
initially selects the first alternative. This selection may itself lead to
a further choice. The evaluator will always initially choose the first
alternative at each choice point. If a choice results in a failure, then
the evaluator
automagically[4]backtracks to the most recent choice point and tries the next
alternative. If it runs out of alternatives at any choice point, the
evaluator will back up to the previous choice point and resume from there.
This process leads to a search strategy known as
depth-first search or
chronological
backtracking.[5]
Driver loop
The
driver loop for the amb evaluator has some
unusual properties. It reads
an expressiona program
and prints the value of the
first non-failing execution, as in the
prime-sum-pairprime_sum_pair
example shown above. If we want to see the value of the next successful
execution, we can ask the interpreter to backtrack and attempt to generate a
second non-failing execution.
Original
JavaScript
This is signaled by typing the symbol
try-again. If any expression except
try-again
is given, the interpreter will start a new problem, discarding the
unexplored alternatives in the previous problem.
This is signaled by typing
retry.
If any other input except retry
is given, the interpreter will start a new problem, discarding the
unexplored alternatives in the previous problem.
Here is a sample interaction:
Original
JavaScript
;;; Amb-Eval input:
(prime-sum-pair '(1 3 5 8) '(20 35 110))
;;; Starting a new problem
;;; Amb-Eval value:
(3 20)
amb-evaluate input:
prime_sum_pair(list(1, 3, 5, 8), list(20, 35, 110));
Starting a new problem
amb-evaluate value:
[3, [20, null]]
try-again
;;; There are no more values of
(prime-sum-pair (quote (1 3 5 8)) (quote (20 35 110)))
amb-evaluate input:
retry
There are no more values of
prime_sum_pair([1, [3, [5, [8, null]]]], [20, [35, [110, null]]])
Original
JavaScript
;;; Amb-Eval input:
(prime-sum-pair '(19 27 30) '(11 36 58))
;;; Starting a new problem
;;; Amb-Eval value:
(30 11)
amb-evaluate input:
prime_sum_pair(list(19, 27, 30), list(11, 36, 58));
Starting a new problem
amb-evaluate value:
[30, [11, null]]
Exercise 4.44
Write a
procedure
an-integer-betweenfunction
an_integer_between
that returns an integer between two given bounds. This can be used to
implement a
procedurefunction
that finds
Pythagorean triples, i.e., triples of integers
$(i,j,k)$ between the given bounds such
that $i \leq j$ and
$i^2 + j^2 =k^2$, as follows:
Original
JavaScript
(define (a-pythagorean-triple-between low high)
(let ((i (an-integer-between low high)))
(let ((j (an-integer-between i high)))
(let ((k (an-integer-between j high)))
(require (= (+ (* i i) (* j j)) (* k k)))
(list i j k)))))
function a_pythogorean_triple_between(low, high) {
const i = an_integer_between(low, high);
const j = an_integer_between(i, high);
const k = an_integer_between(j, high);
require(i * i + j * j === k * k);
return list(i, j, k);
}
There is currently no solution available for this exercise. This textbook adaptation is a community effort. Do consider contributing by providing a solution for this exercise, using a Pull Request in Github.
Exercise 4.45
Exercise 3.72 discussed how to
generate the stream of all
Pythagorean triples, with no upper bound
on the size of the integers to be searched. Explain why simply replacing
an-integer-betweenan_integer_between
by
an-integer-starting-froman_integer_starting_from
in the
procedurefunction
in
exercise 4.44 is not an adequate way to
generate arbitrary Pythagorean triples. Write a
procedurefunction
that actually will accomplish this. (That is, write a
procedurefunction
for which repeatedly typing
try-againretry
would in principle eventually generate all Pythagorean triples.)
There is currently no solution available for this exercise. This textbook adaptation is a community effort. Do consider contributing by providing a solution for this exercise, using a Pull Request in Github.
Exercise 4.46
Ben Bitdiddle claims that the following method for generating
Pythagorean
triples is more efficient than the one in
exercise 4.44. Is he correct?
(Hint: Consider the number of possibilities that must be explored.)
function a_pythagorean_triple_between(low, high) {
const i = an_integer_between(low, high);
const hsq = high * high;
const j = an_integer_between(i, high);
const ksq = i * i + j * j;
require(hsq >= ksq);
const k = math_sqrt(ksq);
require(is_integer(k));
return list(i, j, k);
}
There is currently no solution available for this exercise. This textbook adaptation is a community effort. Do consider contributing by providing a solution for this exercise, using a Pull Request in Github.
[1]
The idea of
amb for nondeterministic programming was
first described in 1961 by
John McCarthy (see
McCarthy 1967).
[2]
In actuality, the distinction between
nondeterministically returning a single choice and returning all choices
depends somewhat on our point of view. From the perspective of the code
that uses the value, the nondeterministic choice returns a single value.
From the perspective of the programmer designing the code, the
nondeterministic choice potentially returns all possible values, and the
computation branches so that each value is investigated
separately.
[3]
One might object that this is a
hopelessly inefficient mechanism. It might require millions of processors
to solve some easily stated problem this way, and most of the time most
of those processors would be idle. This objection should be taken in
the context of history. Memory used to be considered just such an
expensive commodity.
In 1965 a megabyte of RAM cost about $400,000. Now every personal
computer has many gigabytes of RAM, and most of the time most of that RAM is
unused. It is hard to underestimate the cost of mass-produced
electronics.
[4]
Automagically: Automatically, but in a way
which, for some reason (typically because it is too complicated, or too ugly,
or perhaps even too trivial), the speaker doesn't feel like
explaining.
(Steele 1983,
Raymond 1996)
[5]
The integration of
automatic search strategies
into programming languages has had a long and checkered history. The first
suggestions that nondeterministic algorithms might be elegantly encoded in a
programming language with search and automatic backtracking came from
Robert Floyd (1967).
Carl Hewitt (1969) invented a programming language called
Planner that explicitly supported automatic chronological backtracking,
providing for a built-in depth-first search strategy.
Sussman, Winograd, and Charniak (1971) implemented a subset of this language,
called
MicroPlanner, which was used to support work in problem solving and robot
planning. Similar ideas, arising from logic and theorem proving, led to the
genesis in Edinburgh and Marseille of the elegant language
Prolog (which we will discuss in
section 4.4). After sufficient
frustration with automatic search,
McDermott and Sussman (1972) developed a language called
Conniver, which included mechanisms for placing the search strategy under
programmer control. This proved unwieldy, however, and
Sussman and Stallman (1975) found a more tractable approach while
investigating methods of symbolic analysis for electrical circuits. They
developed a nonchronological backtracking scheme that was based on tracing
out the logical dependencies connecting facts, a technique that has come to
be known as
dependency-directed backtracking. Although their method was
complex, it produced reasonably efficient programs because it did little
redundant search.
Doyle (1979) and
McAllester (1978, 1980)
generalized and clarified the methods of Stallman and Sussman, developing a
new paradigm for formulating search that is now called
truth maintenance.
Modern problem-solving systems all
Many problem-solving systems
use some form of truth-maintenance system as a substrate. See
Forbus and de Kleer 1993 for a discussion of elegant
ways to build truth-maintenance systems and applications using truth
maintenance.
Zabih, McAllester, and Chapman 1987 describes a
nondeterministic extension to Scheme that is based on
amb; it is similar to the interpreter described
in this section, but more sophisticated, because it uses dependency-directed
backtracking rather than chronological
backtracking.
Winston 1992 gives an introduction to
both kinds of backtracking.