Skip to content

Releases: developmentAC/BeagleTA

1.5.11

27 Nov 00:11

Choose a tag to compare

There are three parts to this release;

  • 1_beagleTA_parser
    • This binary is the main parser program. It processes a corpus of text files and extracts relevant data based on a set of keywords.
  • 2_csv_cleaner_filter
    • This component is used to clean and filter the output CSV file generated by the parser. It allows users to remove noise such as unnecessary columns and rows based on specific criteria.
  • 3_csv_random_sampler
    • The Sampler tool is used to randomly sample a specified number of rows from a cleaned CSV file and generate one or more smaller CSV files. This ia also particularly useful for working with large datasets that are difficult to process or visualize in their entirety. There are some types of analysis in which the trends are more interesting than the actual documents containing keywords. If the user wants to have a basic snapshot of the kinds of connections in the data, then the Sampler tool could be used to pick a random selection of data to plot these types of results.