Shabdobhedi commited on
Commit
db0e34c
·
verified ·
1 Parent(s): 6d06d99

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +94 -0
README.md CHANGED
@@ -12,3 +12,97 @@ short_description: Extract Hindi and English Text from image
12
  ---
13
 
14
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  ---
13
 
14
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
15
+
16
+
17
+ # OCR and Document Search Web Application Prototype(Using gemini-1.5)
18
+
19
+ ## Introduction
20
+ This project is a web-based prototype designed to perform Optical Character Recognition (OCR) on images containing text in both Hindi and English. The application allows users to upload an image, extract the text, and search for specific keywords within the extracted content. The goal is to demonstrate the functionality of OCR technology in a user-friendly web application.
21
+
22
+
23
+
24
+ ## Technologies
25
+ The following technologies and libraries were used in this project:
26
+
27
+ - **Python**: Programming language used for development.
28
+ - **Google Generative AI**: Library used for performing OCR on images.
29
+ - **Streamlit**: Framework for building the web application interface.
30
+ - **Pillow**: Library for image processing.
31
+
32
+ ## Requirements
33
+ To run this project, you will need the following libraries installed:
34
+
35
+ ```plaintext
36
+ streamlit
37
+ google-generativeai
38
+ Pillow
39
+ ```
40
+
41
+
42
+
43
+
44
+
45
+ ## Project Structure
46
+
47
+
48
+
49
+ ```
50
+
51
+ /OCR-Document-Search-App-Using_gemini-1.5-pro
52
+
53
+ ├── app.py # Main application file
54
+ ├── ocr_utils.py # Utility functions for OCR processing
55
+ └── requirements.txt # List of required libraries
56
+
57
+ ```
58
+ # How to Run the Application
59
+
60
+ ## 1.Clone the Repository:
61
+
62
+
63
+ ```
64
+ git clone https://github.com/pratim808/OCR-Document-Search-App-Using_gemini-1.5-pro.git
65
+ ```
66
+
67
+ ## 2.Navigate to the Project Directory:
68
+
69
+
70
+ ```
71
+ cd OCR-Document-Search-App-Using_gemini-1.5-pro
72
+ ```
73
+
74
+ ## 3.Install Dependencies:
75
+
76
+ ```
77
+ pip install -r requirements.txt
78
+ ```
79
+ ## 4.Run the Application:
80
+
81
+ ```
82
+ streamlit run app.py
83
+ ```
84
+ ## 5.Access the Application:
85
+ ```
86
+ Open your web browser and go to http://localhost:8501.
87
+ ```
88
+
89
+
90
+
91
+ ## Features
92
+
93
+
94
+ - **Image Upload**: Users can upload images in JPG, JPEG, or PNG format.
95
+ - **Text Extraction**: The application uses EasyOCR to extract text from the uploaded image.
96
+ - **Keyword Search**: Users can enter keywords to search within the extracted text, with matching sections highlighted.
97
+ ## Deployment
98
+ The application can be deployed on platforms such as Streamlit Sharing or HF中国镜像站 Spaces. Follow their instructions to make your application accessible via a public URL.
99
+ ## Live URL
100
+
101
+ https://huggingface.co/spaces/Shabdobhedi/OCR-Document-Search-App-Using_gemini-1
102
+ ## Conclusion
103
+ This project demonstrates how OCR technology can be integrated into a web application, providing an efficient tool for extracting and searching text from images.
104
+ ## Demo
105
+
106
+ https://drive.google.com/file/d/1RKYpN6zbqqWFhCHslFAPxJqDgLZbkma6/view?usp=sharing
107
+
108
+