Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding "PDF Organizer" Script #230

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

Retsamic
Copy link

Summary

This pull request introduces a new utility script, "PDF Organizer," designed to automate the organization and management of PDF files within a specified directory. By analyzing PDF metadata, the script efficiently categorizes PDFs into subfolders, renames files for uniformity, and generates a summary report of the process.

Features

  • Metadata Extraction: Utilizes PyPDF2 to extract metadata (title, author) from PDFs.
  • Dynamic Organization: Automatically sorts PDFs into author-named subfolders.
  • Standardized Renaming: Renames PDF files to a consistent "Author - Title.pdf" format.
  • Summary Reporting: Generates a report detailing the organization outcome, including counts of PDFs per category.

Motivation

Managing a large collection of PDF files can be tedious, especially when dealing with academic papers, legal documents, or extensive reports. This script aims to simplify file management, making it easier for users to locate and organize their PDFs based on content relevance.

How to Use

  1. Ensure PyPDF2 is installed: pip install PyPDF2
  2. Modify the source_folder and report_path in the script to match your directory structure.
  3. Run the script: python pdf_organizer.py

Future Enhancements

  • Support for additional metadata fields (e.g., subject, keywords) for finer categorization.
  • Interactive mode allowing users to select organization criteria directly.
  • GUI for ease of use by non-technical users.

I believe this script will be a valuable addition to the repository, offering a practical solution to a common problem faced by many. I look forward to feedback and any suggestions for improvement.

@DhanushNehru
Copy link
Owner

@Retsamic please update the main Readme.md as well

Copy link
Collaborator

@hasan-py hasan-py left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please consider to update main readme list?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants