LLM Literary Analysis Tools

Overview

This repository, LLM Literary Analysis Tools, provides a comprehensive set of tools for analyzing emotional dimensions in Japanese literary works using multiple Large Language Models (LLMs). The project aims to automatically evaluate emotions such as amusement, surprise, sadness, and anger across different models, personas, and parameters, and to compare the results for academic research purposes. Our goal is to contribute to literary studies, education, and the development of emotion-based dialogue systems through automated sentiment analysis.

This work is intended for submission to an international academic journal, with a deadline for preparation by May 30, 2025.

Purpose and Background

The automation of sentiment analysis in literature holds significant potential for advancing literary research, educational tools, and emotion-driven dialogue systems. By leveraging multiple LLMs, this project seeks to uncover differences and consistencies in emotional evaluations, providing insights into model performance and optimal configurations for sentiment analysis.

Features

Emotion Analysis Across Multiple Dimensions: Evaluates Japanese literary works ("Kaichu Tokei", "Okane to Pistol", "Boroboro na Dachou") on four emotional dimensions (amusement, surprise, sadness, anger).
Comparative Studies: Analyzes the impact of different models, personas, and temperature settings on emotional evaluations.
Visualization Tools: Generates publication-ready graphs in PNG and SVG formats, with proper Japanese font support using japanize_matplotlib. Graph fonts, colors, and styles are centrally managed in src/config.py for enhanced readability and maintainability.
Language Switching Functionality: Utilizes messages defined in src/messages.json to switch between Japanese and English graph outputs via command-line options.
Modular Analysis Scripts: Includes series such as model_emotion, text_emotion, persona_emotion, temperature_emotion, and more, each focusing on specific aspects of the data.

Installation

To use these tools, you'll need to have Python 3.x installed along with the required libraries. Follow these steps to set up the environment:

Clone the Repository:

git clone https://github.com/nshrhm/llm-literary-analysis-tools.git
cd llm-literary-analysis-tools

Install Dependencies:

pip install -r requirements.txt

If a requirements.txt file is not provided, install the core libraries manually:

pip install pandas numpy scipy matplotlib seaborn japanize-matplotlib

Usage

To run the full suite of analysis scripts and generate results, execute one of the following scripts:

Generate Japanese graphs:
```
bash make_result_ja.sh
```
Generate English graphs:
```
bash make_result_en.sh
```

These scripts execute all analysis and visualization scripts. Individual scripts in the src/ directory can also be run separately for specific analyses.

For individual visualization scripts (model_emotion_visualize.py, text_emotion_visualize.py, persona_emotion_visualize.py, temperature_emotion_visualize.py, etc.), you can specify the language for the generated graphs using the --lang option:

python ./src/<script_name>.py [--lang {ja,en}]

--lang: Language for visualization (default: ja)
- ja: Japanese
- en: English

For detailed information on each script, refer to the Data Analysis Scripts Guide.

CSV Outputs: Statistical results and data extracts are saved in results/.
Figures: Graphs are saved in results/figures/ in both PNG (for Microsoft Word) and SVG (for LaTeX) formats.
- results/figures/ja/: Japanese graphs
- results/figures/en/: English graphs

Directory Structure

src/: Contains all analysis and visualization scripts.
results/: Stores CSV outputs of statistical analyses.
results/figures/: Stores generated graphs in PNG and SVG formats.
memory-bank/: Documentation and context files for project management and Cline instructions.

Contributing

We welcome contributions from researchers and developers interested in advancing sentiment analysis of literature using LLMs. To contribute:

Fork the repository.
Create a new branch for your feature or bug fix.
Submit a pull request with a detailed description of your changes.

Please report any issues or feature requests via the GitHub Issues page.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

For inquiries or further information, please use the GitHub Issues page to reach out to the project maintainers.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
docs		docs
memory-bank		memory-bank
results		results
src		src
tsv		tsv
.gitignore		.gitignore
README.md		README.md
README_ja.md		README_ja.md
data_sample.csv		data_sample.csv
make_result_en.sh		make_result_en.sh
make_result_ja.sh		make_result_ja.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Literary Analysis Tools

Overview

Purpose and Background

Features

Installation

Usage

Directory Structure

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

nshrhm/llm-literary-analysis-tools

Folders and files

Latest commit

History

Repository files navigation

LLM Literary Analysis Tools

Overview

Purpose and Background

Features

Installation

Usage

Directory Structure

Contributing

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages