Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

How to cite RapidMiner Studio for Scholarly Publications

sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
edited June 2019 in Knowledge Base
I get this question fairly often so I am opening this up for discussion. One option could look something like this:

Mierswa, Ingo, and Ralf Klinkenberg. “RapidMiner Studio.” RapidMiner Account, 9.1.000 (rev: ef0090, platform OSX), RapidMiner, Inc., 12 Dec. 2018, rapidminer.com, https://my.rapidminer.com/nexus/account/index.html. Educational License edition run on MacBookPro13,3 (15-inch, 2016): OS 10.14.2 "Mojave", processor 2.7 GHz Intel Quad-Core i7, L2 cache 256KB per core, L3 cache 8MB, memory 16GB RAM, boot ROM version 251.0.0.0.0, SMC Version 2.38f7, graphics Radeon Pro 3455 2048 MB & Intel HD Graphics 530 1536 MB, Java(TM) SE Runtime Environment (build 1.8.0_144-b01) Java HotSpot(TM) 64-Bit Server VM (build 25.144-b01, mixed mode).
What do you think?

Comments

  • varunm1varunm1 Member Posts: 1,207 Unicorn
    edited January 2019
    @sgenzer Thanks for posting. I thought of asking this question as I just use name in my publications. In APA, software reference looks like this.

    Mierswa, I., & Klinkenberg, R. (2018). RapidMiner Studio (9.1) [Data science, machine learning, predictive analytics]. Retrieved from https://rapidminer.com/
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,529 RM Data Scientist
    edited January 2019
    I was used to cite the old YALE paper from back in the days - is this old fashioned?

    I mean this one:

    And i like github links :)

    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • David_ADavid_A Administrator, Moderator, Employee, RMResearcher, Member Posts: 297 RM Research
    Hi,

    I also normally use the YALE paper (like that in APA style):

    Mierswa, I., Wurst, M., Klinkenberg, R., Scholz, M., & Euler, T. (2006, August). Yale: Rapid prototyping for complex data mining tasks. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 935-940). ACM.
    Which could be a bit misleading, as there is no RapidMiner in the title.

    Other good (and often used) sources to cite are either Predictive Analytics and Data Mining: Concepts and Practice with RapidMiner from Vijay Kotu or 
    RapidMiner: Data Mining Use Cases and Business Analytics Applications from  Markus Hofmann and Ralf Klinkenberg.

    The github link is nice, but problematic, as probably it's not the version used for the analysis.

    For citing the software, I simply add a footnote like that: "available at www.rapidminer.com" 
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
    thanks. All makes sense and likely depends a lot on where / for whom you are publishing.

    Interesting that no one mentioned my inclusion of the tech specs of the machine on which RapidMiner is installed. IMHO for scholarly transparency and repeatability, shouldn't this also be included?
  • David_ADavid_A Administrator, Moderator, Employee, RMResearcher, Member Posts: 297 RM Research
    In theory I agree with you, but in reality I rarely see technical specs in computer/data science papers. At least it would be good practice to name the exact version used.
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
    @David_A do you have some recent papers that I can look at as examples?
  • David_ADavid_A Administrator, Moderator, Employee, RMResearcher, Member Posts: 297 RM Research
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
    awesome! Thanks @David_A!
  • varunm1varunm1 Member Posts: 1,207 Unicorn
    edited February 2019
    Hello, @mschmitz and @David_A Just want to confirm if I can cite software APA style for Automodel features that I used in my research or is there any relevant publication for feature selection in Auto model. I generally cite the below when I write about software used in other paper but just want to confirm as this is a feature selection technique. @sgenzer any suggestion for this?

    Mierswa, I., & Klinkenberg, R. (2019). RapidMiner Studio (9.1) [Data science, machine learning, predictive analytics]. Retrieved from https://rapidminer.com/

    Thanks,
    Varun
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • IngoRMIngoRM Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    edited February 2019
    For general citations of RapidMiner, the way how you did this is definitely acceptable.  Alternatively, you could also use the "official" YALE paper (YALE was the old name of RapidMiner): https://dl.acm.org/citation.cfm?id=1150531
    For unsupervised feature selection, the best paper to cite would be this GECCO paper here: https://dl.acm.org/citation.cfm?id=1144248
    And for general multi-objective optimization in feature engineering, my PhD thesis would be the best citation: http://www-ai.cs.uni-dortmund.de/auto?self=$Publication_fz5hgy8b
    Hope this helps and thanks to all for giving this some thoughts,
    Ingo
  • varunm1varunm1 Member Posts: 1,207 Unicorn
    Thanks @IngoRM
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • IngoRMIngoRM Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Not for that.  It is also a bit of a shameless plug for my own research work ;-)
Sign In or Register to comment.