-
Notifications
You must be signed in to change notification settings - Fork 13
SDAP-118 create a new ranking module #31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
initial work
|
Can one of the admins verify this patch? |
lewismc
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are several comments.
Please make sure that the files are removed from core if they are being built in to ranking.
This is looking good.
| @@ -0,0 +1,215 @@ | |||
| <?xml version="1.0" encoding="UTF-8"?> | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please ensure that license header is formatted correctly.
Also ensure that the code formatting matches what is present in the other pom.xml files.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some files will be moved to ranking module in the next step, such as RankingTrainData.java because I worked on the expert provided train data now.
| </resource> | ||
| </resources> | ||
|
|
||
| <plugins> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We do not need the appassembler-maven-plugin configuration
| </plugin> | ||
|
|
||
| <plugin> | ||
| <groupId>org.apache.maven.plugins</groupId> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We do not need the maven-shade-plugin configuration
ranking/src/main/assembly/bin.xml
Outdated
| @@ -0,0 +1,50 @@ | |||
| <?xml version="1.0" encoding="UTF-8"?> | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We do not need this
| @@ -0,0 +1,57 @@ | |||
| /* | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Make sure that code formatting is 2 space indents
|
|
||
| public static void main(String[] args) { | ||
| SparkFormatter sf = new SparkFormatter(); | ||
| sf.toSparkSVMformat("C:/mudrodCoreTestData/rankingResults/inputDataForSVM.csv", "C:/mudrodCoreTestData/rankingResults/inputDataForSVM_spark.txt"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This cannot be hardcoded.
| * into a user specified directory. | ||
| */ | ||
| public void process() { | ||
| public void convert2TrainSet() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change convert2TrainSet to convertToTrainSet
| for (int i = 1; i < arr.length - row; i++) { | ||
| List<String> colList = new ArrayList<String>(); // create vector to store all values inside of a column, which is stored inside 2D vector | ||
| for (int col = 0; col < arr[0].length - 1; col++) // Columns go until the next to last column | ||
| for (int col = 1; col < arr[0].length - 2; col++) // Columns go until the next to last column |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why has this changed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Before Yongyao deleted the first column of the experts provided data manually and then invoked the function to process data.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
wow, ok thank you for fixing.
|
|
||
| public RankTrainDataFactory(Properties props, ESDriver es, SparkDriver spark) { | ||
| super(props, es, spark); | ||
| // TODO Auto-generated constructor stub |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove TODO
| } | ||
|
|
||
| // start session | ||
| // mode: overwrite or append |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove comments
|
Also @quintinali please make sure that you test this code with monthly input logs. |
|
@quintinali can you update the PR with the changes ? Thanks |
No description provided.