OCR (Optical Character Recognition) technology is the primary method to extract text from images. Spire.OCR for Java provides developers with a quick and efficient solution to scan and extract text from images in Java projects. This article will guide you on how to use Spire.OCR for Java to recognize and extract text from images in Java projects.
Obtaining Spire.OCR for Java
To scan and recognize text in images using Spire.OCR for Java, you need to first import the Spire.OCR.jar file along with other relevant dependencies into your Java project.
You can download Spire.OCR for Java from our website. If you are using Maven, you can add the following code to your project's pom.xml file to import the JAR file into your application.
<repositories> <repository> <id>com.e-iceblue</id> <name>e-iceblue</name> <url>https://repo.e-iceblue.cn/repository/maven-public/</url> </repository> </repositories> <dependencies> <dependency> <groupId>e-iceblue</groupId> <artifactId>spire.ocr</artifactId> <version>1.9.0</version> </dependency> </dependencies>
Please download the other dependencies based on your operating system:
Install Dependencies
Step 1: Create a Java project in IntelliJ IDEA.
Step 2: Go to File > Project Structure > Modules > Dependencies in the menu and add Spire.OCR.jar as a project dependency.
Step 3: Download and extract the other dependency files. Copy all the files from the extracted "dependencies" folder to your project directory.
Scanning and Recognizing Text from a Local Image
- Java
import com.spire.ocr.OcrScanner; import java.io.*; public class ScanLocalImage { public static void main(String[] args) throws Exception { // Specify the path to the dependency files String dependencies = "dependencies/"; // Specify the path to the image file to be scanned String imageFile = "data/Sample.png"; // Specify the path to the output file String outputFile = "ScanLocalImage_out.txt"; // Create an OcrScanner object OcrScanner scanner = new OcrScanner(); // Set the dependency file path for the OcrScanner object scanner.setDependencies(dependencies); // Use the OcrScanner object to scan the specified image file scanner.scan(imageFile); // Get the scanned text content String scannedText = scanner.getText().toString(); // Create an output file object File output = new File(outputFile); // If the output file already exists, delete it if (output.exists()) { output.delete(); } // Create a BufferedWriter object to write content to the output file BufferedWriter writer = new BufferedWriter(new FileWriter(outputFile)); // Write the scanned text content to the output file writer.write(scannedText); // Close the BufferedWriter object to release resources writer.close(); } }
Specify the Language File to Scan and Recognize Text from an Image
- Java
import com.spire.ocr.OcrScanner; import java.io.*; public class ScanImageWithLanguageSelection { public static void main(String[] args) throws Exception { // Specify the path to the dependency files String dependencies = "dependencies/"; // Specify the path to the language file String languageFile = "data/japandata"; // Specify the path to the image file to be scanned String imageFile = "data/JapaneseSample.png"; // Specify the path to the output file String outputFile = "ScanImageWithLanguageSelection_out.txt"; // Create an OcrScanner object OcrScanner scanner = new OcrScanner(); // Set the dependency file path for the OcrScanner object scanner.setDependencies(dependencies); // Load the specified language file scanner.loadLanguageFile(languageFile); // Use the OcrScanner object to scan the specified image file scanner.scan(imageFile); // Get the scanned text content String scannedText = scanner.getText().toString(); // Create an output file object File output = new File(outputFile); // If the output file already exists, delete it if (output.exists()) { output.delete(); } // Create a BufferedWriter object to write content to the output file BufferedWriter writer = new BufferedWriter(new FileWriter(outputFile)); // Write the scanned text content to the output file writer.write(scannedText); // Close the BufferedWriter object to release resources writer.close(); } }
Apply for a Temporary License
If you'd like to remove the evaluation message from the generated documents, or to get rid of the function limitations, please request a 30-day trial license for yourself.