Web Scraper Java

This Java project demonstrates a simple web scraper using Jsoup. It connects to a specified URL, retrieves the webpage's title, and extracts and prints the content from a specific section of the page.

Project Structure

src/main/java/org/oxylabs/Main.java: The main class that performs web scraping.

Features

Connects to a webpage using Jsoup
Retrieves and prints the webpage title
Extracts and prints the content of the "About Us" section (or a similar section based on the provided CSS selector)

Prerequisites

Java Development Kit (JDK) 8 or higher
Maven for dependency management

Setup

Clone the repository:

git clone https://github.com/Hemanths05/web-scraper-java.git
cd web-scraper-java

Navigate to the project directory:
```
cd web-scraper-java
```

Add Jsoup dependency to your pom.xml:

<dependencies>
    <dependency>
        <groupId>org.jsoup</groupId>
        <artifactId>jsoup</artifactId>
        <version>1.15.3</version> <!-- Check for the latest version -->
    </dependency>
</dependencies>

Compile and run the project:

Compile the code:
```
mvn clean compile
```

Run the application:

mvn exec:java -Dexec.mainClass="org.oxylabs.Main"

How It Works

The Main class connects to the URL https://hemanths05.github.io/portfolio_/ using Jsoup.
It retrieves the title of the webpage and prints it.
It selects the content of the section with the CSS class .txtHead and prints it.
If the section is not found, it prints a message indicating that the section was not found.

Error Handling

The program handles exceptions and prints the stack trace if any errors occur during the connection or scraping process.

Contributing

Contributions are welcome! Feel free to open an issue or submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
web-scraper-java		web-scraper-java
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Web Scraper Java

Project Structure

Features

Prerequisites

Setup

How It Works

Error Handling

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Hemanths05/web_scrapper_in_java

Folders and files

Latest commit

History

Repository files navigation

Web Scraper Java

Project Structure

Features

Prerequisites

Setup

How It Works

Error Handling

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages