Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Merging prices from various product lists
helpmemine
Member Posts: 3 Contributor I
Hello everyone!
Being a supply chain manager, I have a lot of market data in my hands and I am under a lot of pressure buying the products at the right time and from the right wholesaler. My business field is related to consumer goods (toiletries, detergents, cosmetics, etc).
Over the last years our network of suppliers has grown and it is time consuming to go through all the product offers to find the best price or a undiscovered bargain, as we carry thousands of articles in the assortment. Fortunately most of the suppliers have a unique identifier in their offers (either EAN or UPC code) which could be used for data mining purposes.
Here is the short description of data from one of the suppliers:
4250925360284 (EAN Code)
Adidas deodorant (Product name)
150 ML (Size)
4,75 (Price)
1200 pcs (Available quantity)
My idea is to consolidate all the offers from wholesalers and form a sound repository/database of products that are offered on the market with an overview of current price minimums, maximums and averages. Essential price information in hand gives me a great competitive edge while executing the purchase orders and surely it will raise the revenues too.
Although I went through several tutorials and videos at the "Resources" section, I have hard time to figure out which Operators to use or how to build the Workflow to get around this problem.
I am a quick learner, so any help is much appreciated which can guide me to the right path.
Thank you!
Being a supply chain manager, I have a lot of market data in my hands and I am under a lot of pressure buying the products at the right time and from the right wholesaler. My business field is related to consumer goods (toiletries, detergents, cosmetics, etc).
Over the last years our network of suppliers has grown and it is time consuming to go through all the product offers to find the best price or a undiscovered bargain, as we carry thousands of articles in the assortment. Fortunately most of the suppliers have a unique identifier in their offers (either EAN or UPC code) which could be used for data mining purposes.
Here is the short description of data from one of the suppliers:
4250925360284 (EAN Code)
Adidas deodorant (Product name)
150 ML (Size)
4,75 (Price)
1200 pcs (Available quantity)
My idea is to consolidate all the offers from wholesalers and form a sound repository/database of products that are offered on the market with an overview of current price minimums, maximums and averages. Essential price information in hand gives me a great competitive edge while executing the purchase orders and surely it will raise the revenues too.
Although I went through several tutorials and videos at the "Resources" section, I have hard time to figure out which Operators to use or how to build the Workflow to get around this problem.
I am a quick learner, so any help is much appreciated which can guide me to the right path.
Thank you!
0
Answers
what you want is to join your data sources. Either inside a DB or in RM with the Join operator. Afterwards you want to use a Aggregate and Group By your EAN Codes. With that you can create the min/max/std_dev of your prices.
If you have an excel file with 10-20 examples i might find some time for an example process.
Dortmund, Germany
Thank you for a such a prompt reply!
The Join Operator lead was highly useful! I am currently going through the Join Operator tutorial and playing with the example data to understand how the process is built up and how the key attributes and operators work together. Good stuff!
I will try to move towards Aggregate and Group By later today.
Also, I compiled some example data, as requested:
https://dl.dropboxusercontent.com/u/90694307/Example%20data.xls
attached is a real small process calculating stuff grouped by your categories. If you want to use Generate Attribtues afterwards you need to replace the brackets.
You can simply copy this XML code into the XML view of your Rapidminer Studio. Be sure to change the path in Read Excel.
Best,
Martin
Dortmund, Germany
Thank you, this really helped a lot! After hours of integrating, I finally managed to get it working.
I already stumbled upon available quantity related questions, but I found a lead from the tutorial, again.
All the best!
great that it helped you. I don't know where you are working, but you might be interested in our trainings. We currently offer trainings in the US, UK and Germany. For details see: https://rapidminer.com/learning/training/ The Basic courses might be really suited for you to get into Predictive Analytics/Data Mining/ETL in Rapidminer etc.
If you are forced to learn it yourself, you might have a look at "Data Mining for the masses" a free book avaible here: https://rapidminer.com/wp-content/uploads/2013/10/DataMiningForTheMasses.pdf
It is really helpful for beginners.
If you have any further question, feel free to ask!
Best,
Martin
Dortmund, Germany