Web Scraper Service

This repository handles web scraping of "https://books.toscrape.com/" and the business logic written for Web scraping traverses all pages on "https://books.toscrape.com/" recursively and downloads and saves all files (pages, images...) to disk while keeping the file structure.

Currently, Web scraping implementation is using Jsoup Library.

Running the service

To run the service, please execute following command in the command prompt:./gradlew run'
All the downloaded files will be stored under Books directory.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
gradle/wrapper		gradle/wrapper
src		src
.gitignore		.gitignore
BACKEND 1337.pdf		BACKEND 1337.pdf
README.md		README.md
build.gradle		build.gradle
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraper Service

Running the service

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Web Scraper Service

Running the service

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages