Categories: Homework on time

1.      Complete code in a compressed archive (zip, tgz, etc)2.      A readme file with complete des

1.      Complete code in a compressed archive (zip, tgz, etc)2.      A readme file with complete description of used software, installation, compilation and execution instructions3.      A document with the results for the questions below.Task:Develop a specialized Web crawler. Test your crawler only on the data in:Make sure that your crawler is not allowed to get out of this directory!!! Yes, there is a robots.txt file that must be used. Note that it is in a non-standard location.The required input to your program is N, the limit on the number of pages to retrieve and a list of stop words (of your choosing) to exclude. Perform case insensitive matching.You can assume that there are no errors in the input. Your code should be robust under errors in the Web pages you’re searching. If an error is encountered, feel free, if necessary, just to skip the page where it is encountered.Efficiency: Don’t be ridiculously inefficient. There’s no need to deliver turbo-charged algorithms or implementations. You don’t need to worry about memory constraints; if your program runs out of space and dies on encountering a large file, that’s OK. You do not have to use multiple threads; sequential downloading is OK.

Don't use plagiarized sources. Get Your Custom Essay on
1.      Complete code in a compressed archive (zip, tgz, etc)2.      A readme file with complete des
Just from $13/Page
Order Essay
superadmin

Share
Published by
superadmin

Recent Posts

Consider the following information, and answer the question below. China and England are internation

Consider the following information, and answer the question below. China and England are international trade…

4 years ago

The CPA is involved in many aspects of accounting and business. Let’s discuss some other tasks, othe

The CPA is involved in many aspects of accounting and business. Let's discuss some other…

4 years ago

For your initial post, share your earliest memory of a laser. Compare and contrast your first percep

For your initial post, share your earliest memory of a laser. Compare and contrast your…

4 years ago

2. The Ajax Co. just decided to save $1,500 a month for the next five years as a safety net for rece

2. The Ajax Co. just decided to save $1,500 a month for the next five…

4 years ago

How to make an insertion sort to sort an array of c strings using the following algorithm: * beg, *

How to make an insertion sort to sort an array of c strings using the…

4 years ago

Assume the following Keynesian income-expenditure two-sector model:

Assume the following Keynesian income-expenditure two-sector model:                                                AD = Cp + Ip                                                Cp = Co…

4 years ago