数据结构与算法分析c++版答案.pdf
Data Structures and Algorithm 习题答案 Preface ii 1 Data Structures and Algorithms 1 2 Mathematical Preliminaries 5 3 Algorithm Analysis 17 4 Lists,Stacks,and Queues 23 5 Binary Trees 32 6 General Trees 40 7 Internal Sorting 46 8 File Processing and External Sorting 54 9Searching 58 10 Indexing 64 11 Graphs 69 12 Lists and Arrays Revisited 76 13 Advanced Tree Structures 82 i ii Contents 14 Analysis Techniques 88 15 Limits to Computation 94 Preface Contained herein are the solutions to all exercises from the textbook A Practical Introduction to Data Structures and Algorithm Analysis,2nd edition.For most of the problems requiring an algorithm I have given actual code.In a few cases I have presented pseudocode.Please be aware that the code presented in this manual has not actually been compiled and tested.While I believe the algorithms to be essentially correct,there may be errors in syntax as well as semantics.Most importantly,these solutions provide a guide to the instructor as to the intended answer,rather than usable programs.1 Data Structures and Algorithms Instructors note:Unlike the other chapters,many of the questions in this chapter are not really suitable for graded work.The questions are mainly intended to get students thinking about data structures issues.This question does not have a specific right answer,provided the student keeps to the spirit of the question.Students may have trouble with the concept of“operations.”This exercise asks the student to expand on their concept of an integer representation.A good answer is described by Project,where a singly-linked list is suggested.The most straightforward implementation stores each digit in its own list node,with digits stored in reverse order.Addition and multiplication are implemented by what amounts to grade-school arithmetic.For addition,simply march down in parallel through the two lists representing the operands,at each digit appending to a new list the appropriate partial sum and bringing forward a carry bit as necessary.For multiplication,combine the addition function with a new function that multiplies a single digit by an integer.Exponentiation can be done either by repeated multiplication (not really practical)or by the traditional(log n)-time algorithm based on the binary representation of the exponent.Discovering this faster algorithm will be beyond the reach of most students,so should not be required.A sample ADT for character strings might look as follows(with the normal interpretation of the function names assumed).Chap.1 Data Structures and Algorithms Some In C+,this is 1 for s1s2;0 for s1=s2;int strcmp(String s1,String s2)Ones compliment stores the binary representation of positive numbers,and stores the binary representation of a negative number with the bits inverted.Twos compliment is the same,except that a negative number has its bits inverted and then one is added(for reasons of efficiency in hardware implementation).This representation is the physical implementation of an ADT defined by the normal arithmetic operations,declarations,and other support given by the programming language for integers.An ADT for two-dimensional arrays might look as follows.Matrix add(Matrix M1,Matrix M2);Matrix multiply(Matrix M1,Matrix M2);Matrix transpose(Matrix M1);void setvalue(Matrix M1,int row,int col,int val);int getvalue(Matrix M1,int row,int col);List getrow(Matrix M1,int row);One implementation for the sparse matrix is described in Section Another implementation is a hash table whose search key is a concatenation of the matrix coordinates.Every problem certainly does not have an algorithm.As discussed in Chapter 15,there are a number of reasons why this might be the case.Some problems dont have a sufficiently clear definition.Some problems,such as the halting problem,are non-computable.For some problems,such as one typically studied by artificial intelligence researchers,we simply dont know a solution.We must assume that by“algorithm”we mean something composed of steps are of a nature that they can be performed by a computer.If so,than any algorithm can be expressed in C+.In particular,if an algorithm can be expressed in any other computer programming language,then it can be expressed in C+,since all (sufficiently general)computer programming languages compute the same set of functions.The primitive operations are(1)adding new words to the dictionary and(2)searching the dictionary for a given word.Typically,dictionary access involves some sort of pre-processing of the word to arrive at the“root”of the word.A twenty page document(single spaced)is likely to contain about 20,000 words.A user may be willing to wait a few seconds between individual“hits”of mis-spelled words,or perhaps up to a minute for the whole document to be processed.This means that a check for an individual word can take about 10-20 ms.Users will typically insert individual words into the dictionary interactively,so this process can take a couple of seconds.Thus,search must be much more efficient than insertion.The user should be able to find a city based on a variety of attributes(name,location,perhaps characteristics such as population size).The user should also be able to insert and delete cities.These are the fundamental operations of any database system:search,insertion and deletion.A reasonable database has a time constraint that will satisfy the patience of a typical user.For an insert,delete,or exact match query,a few seconds is satisfactory.If the database is meant to support range queries and mass deletions,the entire operation may be allowed to take longer,perhaps on the order of a minute.However,the time spent to process individual cities within the range must be appropriately reduced.In practice,the data representation will need to be such that it accommodates efficient processing to meet these time constraints.In particular,it may be necessary to support operations that process range queries efficiently by processing all cities in the range as a batch,rather than as a series of operations on individual cities.Students at this level are likely already familiar with binary search.Thus,they should typically respond with sequential search and binary search.Binary search should be described as better since it typically needs to make fewer comparisons (and thus is likely to be much faster).The answer to this question is discussed in Chapter 8.Typical measures of cost will be number of comparisons and number of swaps.Tests should include running timings on sorted,reverse sorted,and random lists of various sizes.Chap.1 Data Structures and Algorithms The first part is easy with the hint,but the second part is rather difficult to do without a stack.a)bool checkstring(string S)int count=0;for(int i=0;ilength(S);i+)if(Si=()count+;if(Si=)if(count=0)return FALSE;count-;if(count=0)return TRUE;else return FALSE;b)int checkstring(String Str)Stack S;int count=0;for(int i=0;i 0.It is symmetric since xy =yx.It is transitive since any two members of the given class satisfy the relationship.5 Chap.2 Mathematical Preliminaries (d)This is not an equivalance relation since it is not symmetric.For example,a =1and b =2.(e)This is an eqivalance relation that divides the rationals based on their fractional values.It is reflexive since for all a,=0.It is symmetric since if =x then =.x.It is transitive since any two rationals with the same fractional value will yeild an integer.(f)This is not an equivalance relation since it is not transitive.For example,4.2=2and 2.0=2,but 4.0=4.A relation is a partial ordering if it is antisymmetric and transitive.(a)Not a partial ordering because it is not transitive.(b)Is a partial ordering bacause it is antisymmetric(if a is an ancestor of b,then b cannot be an ancestor of a)and transitive(since the ancestor of an ancestor is an ancestor).(c)Is a partial ordering bacause it is antisymmetric(if a is older than b,then b cannot be older than a)and transitive(since if a is older than b and b is older than c,a is older than c).(d)Not a partial ordering,since it is not antisymmetric for any pair of sisters.(e)Not a partial ordering because it is not antisymmetric.(f)This is a partial ordering.It is antisymmetric(no violations exist)and transitive(no violations exist).A total ordering can be viewed as a permuation of the elements.Since there are n!permuations of n elements,there must be n!total orderings.This proposed ADT is inspired by the list ADT of Chapter 4.void clear();void insert(int);void remove(int);void sizeof();bool isEmpty();bool isInSet(int);This proposed ADT is inspired by the list ADT of Chapter 4.Note that while it is similiar to the operations proposed for Question,the behaviour is somewhat different.void clear();void insert(int);void remove(int);void sizeof();7 bool isEmpty();long ifact(int n)The iterative version requires careful examination to understand what it does,or to have confidence that it works as claimed.(b)Fibr is so much slower than Fibi because Fibr re-computes the bulk of the series twice to get the two values to add.What is much worse,the recursive calls to compute the subexpressions also re-compute the bulk of the series,and do so recursively.The result is an exponential explosion.In contrast,Fibicomputes each value in the series exactly once,and so its running time is proportional to n.void GenTOH(int n,POLE goal,POLE t1,POLE t2,POLE*curr)if(currn=goal)Put others on t1.GenTOH(n-1,t1,goal,t2,curr);move(t2,goal);GenTOH(n-1,goal,t1,t2,curr);In theory,this series approaches,but never reaches,0,so it will go on forever.In practice,the value should become computationally indistinguishable from zero,and terminate.However,this is terrible programming practice.Chap.2 Mathematical Preliminaries void allpermute(int array,int n,int currpos)if(currpos=(n-1)printout(array);return;for(int i=currpos;in;i+)swap(array,currpos,i);allpermute(array,n,currpos+1);swap(array,currpos,i);The idea is the print out the elements at the indicated bit positions within the set.If we do this for values in the range 0 to 2n .1,we will get the entire powerset.void powerset(int n)for(int i=0;iipow(2,n);i+)for(int j=0;jn;j+)if(bitposition(n,j)=1)cout j ;cout endl;Proof:Assume that there is a largest prime number.Call it Pn,the nth largest prime number,and label all of the primes in order P1=2,P2=3,and so on.Now,consider the number C formed by multiplying all of the n prime numbers together.The value C +1is not divisible by any of the n prime numbers.C +1is a prime number larger than Pn,a contradiction.Thus,we conclude that there is no largest prime number.Note:This problem is harder than most sophomore level students can handle.Proof:The proof is by contradiction.Assume that 2is rational.By definition,there exist integers p and q such that p 2=,q where p and q have no common factors(that is,the fraction p/q is in lowest terms).By squaring both sides and doing some simple algebraic manipulation,we get 2 p 2=2 q 22 2q =p Since p2 must be even,p must be even.Thus,9 22 2q =4(p)2 22 q =2(p)2 This implies that q2 is also even.Thus,p and q are both even,which contra dicts the requirement that p and q have no common factors.Thus,2must be irrational.The leftmost summation sums the integers from 1 to n.The second summation merely reverses this order,summing the numbers from n .1+1=n down to n .n +1=1.The third summation has a variable substitution of i,with a corresponding substitution in the summation bounds.Thus,it is also the summation of n .0=n to n .(n .1)=1.Proof:(a)Base case.For n =1,12=2(1)3+3(1)2+1/6=1.Thus,the formula is correct for the base case.(b)Induction Hypothesis.2(n .1)3+3(n .1)2+(n .1)i2=.6 i=1 (c)Induction Step.i2 i2+n 2 =i=1 i=1 2(n .1)3+3(n .1)2+(n .1)2 =+n 6 2n3.6n2+6n .2+3n2.6n +3+n .1 2 =+n 6 2n3+3n2+n =.6 Thus,the theorem is proved by mathematical induction.Proof:(a)Base case.For n =1,1/2=1.1/2=1/2.Thus,the formula is correct for the base case.(b)Induction Hypothesis.11 =1.2 i=1 Chap.2 Mathematical Preliminaries (c)Induction Step.1 11 =+i in 222 i=1 i=1 11 =1.+n 221 =1.n 2 Thus,the theorem is proved by mathematical induction.Proof:(a)Base case.For n =0,20=21.1=1.Thus,the formula is correct for the base case.(b)Induction Hypothesis.2i =2n .1.i=0 (c)Induction Step.2i =2i +2n i=0 i=0 n =2n .1+2n+1.1 =2.Thus,the theorem is proved by mathematical induction.The closed form solution is 3n+,which I deduced by noting that 3F (n).2 n+1.3 F (n)=2F (n)=3.Now,to verify that this is correct,use mathematical induction as follows.For the base case,F (1)=3=.2 The induction hypothesis is that=(3n .3)/2.i=1 So,3i =3i +3n i=1 i=1 3n .3 n =+3 2 n+1.3 3 =.2 Thus,the theorem is proved by mathematical induction.11 n Theorem (2i)=n2+n.i=1 (a)Proof:We know from Example that the sum of the first n odd numbers is ith even number is simply one greater than the ith odd number.Since we are adding nsuch numbers,the sum must be n greater,or n2+n.(b)Proof:Base case:n=1yields 2=12+1,which is true.Induction Hypothesis:2i=(n.1)2+(n.1).i=1 Induction Step:The sum of the first neven numbers is simply the sum of the first n.1even numbers plus the nth even number.2i =(2i)+2n i=1 i=1 =(n.1)2+(n.1)+2n =(n 2.2n+1)+(n.1)+2n =n 2.n+2n =n 2+n.n Thus,by mathematical induction,2i=n2+n.i=1 Proof:52 Base case.For n =1,Fib(1)=1 n=2,Fib(2)=1(5).3 Thus,the formula is correct for the base case.Induction Hypothesis.For all positive integers in,5 i Fib(i)().3 Induction Step.Fib(n)=Fib(n.1)+Fib(n.2)and,by the Induction Hypothesis,Fib(n.1)(5)and Fib(n.2)(5)33 55 Fib(n)()+()3355 5 ()+()333 Chap.2 Mathematical Preliminaries 85 =()3355 5.20nand 2n are never more efficient than the other choices.Both log3 nand log2 nwill have value 0 when n=1.Otherwise,2 is the most efficient expression for all n1.2/32 3n 2 log3n log2 nn20n 4nn!.(a)n+6 inputs(an additive amount,independent of n).(b)8ninputs(a multiplicative factor).(c)64ninputs.100n.10n.About(actually,3 100n).n+6.(a)These questions are quite hard.If f(n)=2n =x,then f(2n)=22n =2 (2n)2=x.(b)The answer is 2(nlog2 3).Extending from part(a),we need some way to make the growth rate even higher.In particular,we seek some way to log2 3=make the exponent go up by a factor of 3.Note that,if f(n)=n)=2log2 3log2 3=3x y,then f(2nn.So,we combine this observation with part(a)to get the desired answer.First,we need to find constants cand no such that 1 c 1 for nn0.This is true for any positive value cn0.This is true for,say,c=1 and n0=1.17 18 Chap.3 Algorithm Analysis Other values for n0 and care possible than what is given here.(a)The upper bound is O(n)for n0 0 and c=c1.The lower bound is (n)for n0 0 and c=c1.(b)The upper bound is O(n3)for n0 c3 and c =c2+1.The lower bound is(n3)for n0 c3 and c=c2.(c)The upper bound is O(nlog n)for n0 c5 and c=c4+1.The lower bound is(nlog n)for n0 c5 and c=c4.(d)The upper bound is O(2n)for n0 c7100 and c =c6+lower bound is(2n)for n0 c7100 and c =c6.(100 is used for convenience to insure that 2n n6)(a)f(n)=(g(n)since log n2=2 log n.(b)f(n)is in