Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Repository location (a question from a beginner)
Hello
I am a new to RapidMiner.
I am trying to do text analysis of texts which are contained in a csv file. I installed RapidMiner on Windows PC, and tried data import wizard. In Step 5 of wizard, RapidMiner ask me to input the location of a repository, saying "Please specify a repository location". In the window, local repository directly is displayed with data and processes folder. I am wondering what I should do in this process. If I could find the "Local Repository" folder on my PC, I will try to copy the csv file into data or processes folder, but I could not find it by searching my PC.
Any advice will be welcomed.
Thanks
Y12
I am a new to RapidMiner.
I am trying to do text analysis of texts which are contained in a csv file. I installed RapidMiner on Windows PC, and tried data import wizard. In Step 5 of wizard, RapidMiner ask me to input the location of a repository, saying "Please specify a repository location". In the window, local repository directly is displayed with data and processes folder. I am wondering what I should do in this process. If I could find the "Local Repository" folder on my PC, I will try to copy the csv file into data or processes folder, but I could not find it by searching my PC.
Any advice will be welcomed.
Thanks
Y12
0
Answers
To load/store data from RapidMiner processes, you most of the time use a "repository". All repositories are treated the same in RapidMiner Studio, regardless of where the files actually are. The files you store in a repository can be stored on a file system (your default "Local Repository"), on a RM Server, in the Cloud or in a database. The repository mechanism was introduced so one has a convenient way of storing/retrieving files without having to worry about specific source/target location details.
In step 5 of the CSV import wizard you simply specify where you want to store the former .csv file in your repository for later access. When doing so, it is converted to a common format for data in Studio, a so called Example Set. Which is basically a table similar to a database table. After you have saved the data in the repository, you can then access it via the "Retrieve" operator in your Studio processes and do whatever you like with it.
Regards,
Marco