How to cite RapidMiner Studio for Scholarly Publications

sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor Posts: 2,130  Community Manager
edited January 4 in Knowledge Base
I get this question fairly often so I am opening this up for discussion. One option could look something like this:

Mierswa, Ingo, and Ralf Klinkenberg. “RapidMiner Studio.” RapidMiner Account, 9.1.000 (rev: ef0090, platform OSX), RapidMiner, Inc., 12 Dec. 2018, rapidminer.com, https://my.rapidminer.com/nexus/account/index.html. Educational License edition run on MacBookPro13,3 (15-inch, 2016): OS 10.14.2 "Mojave", processor 2.7 GHz Intel Quad-Core i7, L2 cache 256KB per core, L3 cache 8MB, memory 16GB RAM, boot ROM version 251.0.0.0.0, SMC Version 2.38f7, graphics Radeon Pro 3455 2048 MB & Intel HD Graphics 530 1536 MB, Java(TM) SE Runtime Environment (build 1.8.0_144-b01) Java HotSpot(TM) 64-Bit Server VM (build 25.144-b01, mixed mode).
What do you think?
varunm1topaz_nsgenzer

Comments

  • varunm1varunm1 Member Posts: 207   Unicorn
    edited January 2
    @sgenzer Thanks for posting. I thought of asking this question as I just use name in my publications. In APA, software reference looks like this.

    Mierswa, I., & Klinkenberg, R. (2018). RapidMiner Studio (9.1) [Data science, machine learning, predictive analytics]. Retrieved from https://rapidminer.com/
    Regards,
    Varun
    sgenzer
  • mschmitzmschmitz Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 1,952  RM Data Scientist
    edited January 2
    I was used to cite the old YALE paper from back in the days - is this old fashioned?

    I mean this one:

    And i like github links :)

    - Head of Data Science Services at RapidMiner -
    Dortmund, Germany
    sgenzer
  • David_ADavid_A Moderator, Employee, RMResearcher, Member Posts: 139  RM Research
    Hi,

    I also normally use the YALE paper (like that in APA style):

    Mierswa, I., Wurst, M., Klinkenberg, R., Scholz, M., & Euler, T. (2006, August). Yale: Rapid prototyping for complex data mining tasks. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 935-940). ACM.
    Which could be a bit misleading, as there is no RapidMiner in the title.

    Other good (and often used) sources to cite are either Predictive Analytics and Data Mining: Concepts and Practice with RapidMiner from Vijay Kotu or 
    RapidMiner: Data Mining Use Cases and Business Analytics Applications from  Markus Hofmann and Ralf Klinkenberg.

    The github link is nice, but problematic, as probably it's not the version used for the analysis.

    For citing the software, I simply add a footnote like that: "available at www.rapidminer.com" 
    sgenzertopaz_n
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor Posts: 2,130  Community Manager
    thanks. All makes sense and likely depends a lot on where / for whom you are publishing.

    Interesting that no one mentioned my inclusion of the tech specs of the machine on which RapidMiner is installed. IMHO for scholarly transparency and repeatability, shouldn't this also be included?
  • David_ADavid_A Moderator, Employee, RMResearcher, Member Posts: 139  RM Research
    In theory I agree with you, but in reality I rarely see technical specs in computer/data science papers. At least it would be good practice to name the exact version used.
    sgenzer
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor Posts: 2,130  Community Manager
    @David_A do you have some recent papers that I can look at as examples?
  • David_ADavid_A Moderator, Employee, RMResearcher, Member Posts: 139  RM Research
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor Posts: 2,130  Community Manager
    awesome! Thanks @David_A!
  • varunm1varunm1 Member Posts: 207   Unicorn
    edited February 4
    Hello, @mschmitz and @David_A Just want to confirm if I can cite software APA style for Automodel features that I used in my research or is there any relevant publication for feature selection in Auto model. I generally cite the below when I write about software used in other paper but just want to confirm as this is a feature selection technique. @sgenzer any suggestion for this?

    Mierswa, I., & Klinkenberg, R. (2019). RapidMiner Studio (9.1) [Data science, machine learning, predictive analytics]. Retrieved from https://rapidminer.com/

    Thanks,
    Varun
    Regards,
    Varun
  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,536  RM Founder
    edited February 4
    For general citations of RapidMiner, the way how you did this is definitely acceptable.  Alternatively, you could also use the "official" YALE paper (YALE was the old name of RapidMiner): https://dl.acm.org/citation.cfm?id=1150531
    For unsupervised feature selection, the best paper to cite would be this GECCO paper here: https://dl.acm.org/citation.cfm?id=1144248
    And for general multi-objective optimization in feature engineering, my PhD thesis would be the best citation: http://www-ai.cs.uni-dortmund.de/auto?self=$Publication_fz5hgy8b
    Hope this helps and thanks to all for giving this some thoughts,
    Ingo
    varunm1topaz_nsgenzer
  • varunm1varunm1 Member Posts: 207   Unicorn
    Thanks @IngoRM
    Regards,
    Varun
  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,536  RM Founder
    Not for that.  It is also a bit of a shameless plug for my own research work ;-)
    topaz_nsgenzer
Sign In or Register to comment.