A flexible Python library for calculating 10-Nearest Neighbors similarity scores from company business descriptions.
- Text Processing: TF-IDF vectorization of business descriptions
- Similarity Calculation: Cosine similarity between companies
- 10NN Identification: Find closest competitors for each firm
- Flexible Output: Multiple output formats for different analyses
- Extensible: Easy to integrate with various downstream applications