I using the K-NN algorithm to classification.
my learning data has 3 dimensions and like this:
Username(Polynomial),Count_of_connections(Integer), Destination_IP(Polynomial), Status (as label)
Alex, 100, 172.16.1.11,normal
Mat, 200, 172.16.5.1 , normal
Angelo, 50, 172.16.4.11, normal
Alexis, 8 , 172.16.1.10
Alexis, 8 , 172.16.1.10, normal
I want to the algorithm do not compare text in username and Destination_IP and decide based on Count_of_Connections only.
It means the characters of Username and Destination_IP values most not effect on algorithm and this dimensions considered as unit entity, i don't know how solve this problem!