I want to design a crawler, using java, that crawls a webpage and extract certain contents of the page. How should I do this? I am new and I need guidance to start designing crawlers.
For example, I want to access the content "red is my favorite color" from a webpage which is embedded something like below:
< div >red is my favorite color< / div >
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…