It looks like you're new here. Sign in or register to get started.
My System has intel Core(TM) i3 Processor ,2.4 GHZ and 64 bit Windows 7 OS,and 4 GB ram these are enough to install RapidMiner Studio
Please visit: https://docs.rapidminer.com/studio/installation/system-requirements.html for more information.
I have a system configuration of ubuntu X64 4 GB RAM, and i am doing project on finding anomalies in network traffic. my machine is connected to a database VM, whenever i try to fetch data from database like for 3 years machine gets hanged. what id the ideal system requirement do you suggest for large datasets.
hello @rxk610 - welcome to the community. As @Thomas_Ott says, a database hang could be for any number of reasons. If you're running RM with only 4GB of RAM (the bare minimum) and you're pulling in large tables from an external database, I'm not surprised that it's hanging. A wise method would be to limit the database query to a thousand or so rows first, get everything working, and then incrementally increase your query limit to see if that's the problem.
Hi Thanks for the reply
can you help me wih the db query to limit the no of rows.
hi @rxk610 - so that's not really something I can do quickly without knowing more about your database. If you're in MySQL, the query is usually something like this:
SELECT * FROM <table> LIMIT 1000;
I usually go to https://www.w3schools.com/sql/default.asp for my SQL query help.
Thanks Scott for the help, I believe that LIMIT query helped in filtering the data somehow and we have upgraded the server to 8 GB as well.
Things are pretty smooth now, but not fully.
I am working on the project of anomaly detection using sdata (network traffic). Cna you please send me the links for education tutorial to get more insight on the tool to detect anomalies.
hello @rxk610 - lots of resources with outlier detection. @Andrew's video is quite good (https://www.youtube.com/watch?v=W9MEXJB8HQ4) as this one that talks about fraud detection (https://www.youtube.com/watch?v=Or_o-ntpLSo).
I am working on anomaly detection using outlier detection where there are different columns namely time_start, time_end, Src_address, dest_address, src_port, dest_port etc. I was planning to find outlier detection based on source_ports which are hitting on our internal IP (dest_address) range so that I can distinguish which are the ambiguous ports which are coming inside uni network. But I am not able to get through this as I am unable to create a model for that.
I would be grateful if you can help me with this, as it is quite urgent.
hello @rxk610 - we're all here and happy to help as best as we can. If you could please share your process (see instructions "Read Before Posting on right), a sample data set, and other info that would help.
I would be happy if I could have provided you the sample data but unfortunately, I cannot download the sample data as there is no internet connection on VM as per security policy. But I can give my laptop on remote via remote desktop/team viewer if it sounds good.
hello @rxk610 - my apologies but I normally don't do one-on-one assistance. I'm the only full-time employee of RapidMiner that actively monitors the community and you can imagine how crazy that would be if I did one-on-one with everyone. Feel free to post here as much as you like.
I was applying operator detect outlier detection using Detect Outlier using distance with columns in data set includes start time, end time, source port, dest port, source address destination address, it is giving error Failed to load extension cache.
Actually, I want to outlier distance between destination IP with which port as anomaly.
Please give your expert opinion.
Please find the error attached
hello @rxk610 - hmm ok that does not really seem like a RapidMiner problem but rather that your computer is not connected to the internet (see similar error about NewsService warning). And I think you already know this because you said you cannot download the sample data etc..
Perhaps you can at least read the "Read Before Posting" instructions and share your XML?
But why the internet is required to run the outlier detection, this is running locally right? Request you to please explain.
The error message you posted is "only" the log output. You are assuming correct that extensions are loaded locally.
The warning in your screenshot seems not relevant to me - at least for the displayed process.
Actually, I do not see any issue in your process.
Does it fail (i.e. aborting with an error pop up)? or do you need a different kind of assistance?