The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Correlation Matrix
Hi,
I have a large data set with many attributes
I would like to see how closely the attributes are correlated but because of the sheer number of them I'm only interested in attributes that are correlated about 40%
Is there a way to do this for example using a filter of some description. I know you can remove correlated attributes and select by weights but are not what i need as im interested in the high correlations
Thank you for your time
I have a large data set with many attributes
I would like to see how closely the attributes are correlated but because of the sheer number of them I'm only interested in attributes that are correlated about 40%
Is there a way to do this for example using a filter of some description. I know you can remove correlated attributes and select by weights but are not what i need as im interested in the high correlations
Thank you for your time
Tagged:
0
Answers
There are options like "top k" and "top p%" in the Select by Weights operator that might help.
regards
Andrew
Thanks for the quick reply. I ran it this morning but i don't think this is what I'm looking for
What i need is the pairwise table so i can specifically say there is a 50% correlation between Attribute A and B but a Negative correalation between A and C
Do you know if you can filter the actual matrix?
Thanks
Is there perhaps a method to export the pairwise table into a CSV file or generate a report based off of it?
Has anyone tried it before
If it was in a database it would be simple case of selecting the rows where the correlation is above a certain amount
Thanks
A groovy script would be able to do it. I could probably do that in return for beer or money ;D
Alternatively, I'm having a think about the possibility of calculating the correlation in a process without using the built in operators. That way would let you make an example set that could be filtered as you like.
regards
Andrew
But unfortunately, it doesn't provide a pairwise table and the matrix in question is 5000 attributes in scope so exporting it to excel means cutting off a good portion of it
Il keep the beer money in mind of course , as soon as the next pay check comes around
Have a look at process below:
This works
However i have one last problem in relation to this
My pair wise table is going to generate roughly 25 million rows which is not exportable using a report
Is there anyway to filter the matrix/pairwise table so that say only attributes with a certain correlation are exported for example only return attributes with 50% or more correlation?
Thanks