Genetic_Programming_Theory_and_Practice_XIII

(C. Jardin) #1

134 S. Gustafson et al.


for two reasons. Firstly, the code base of the GBR method consisted of open source
code that could be easily implemented in a prototype environment by application
engineers. The GP system contained too much unsupported code and many scripts
that it required for data management, ensemble learning, diversity measurement,
and visualization. Secondly, the training process was far simpler (both in number of
lines and amount of time) for the GBR system. The GP system required more time to
retrain and many intermediate files had to be managed. This chapter also introduced
Data Science and emphasized the value of having a Data Management solution
tightly coupled with GP. We leveraged an Ontology-Based Data Access approach
using Semantic Web technologies. We hope that these contributions inspire future
work that enable GP to become an effective Data Science tool.


References


Arnaldo I, Veeramachaneni K, O’Reilly UM (2014) Flash: A GP-GPU ensemble learning system
for handling large datasets. In: Nicolau M, et al. (eds.) 17th European conference on genetic
programming. LNCS, vol. 8599. Springer, Granada, pp 13–24
Castillo F, Kordon A, Sweeney J, Zirk W (2004) Using genetic programming in industrial statistical
model building. In: O’Reilly UM, Yu T, Riolo RL, Worzel B (eds.) Genetic programming theory
and practice II, Chap. 3. Springer, Ann Arbor, pp 31–48
De Rainville FM, Fortin FA, Gardner MA, Parizeau M, Gagne C (2012) DEAP: a python
framework for evolutionary algorithms. In: Wagner S, Affenzeller M (eds.) GECCO 2012
evolutionary computation software systems (EvoSoft). ACM, Philadelphia, PA, pp 85–92
Dhar V (2013) Data science and prediction. Commun ACM 56(12):64–73
Dubcakova R (2011) Eureqa: software review. Genet. Program. Evolvable Mach. 12(2):173–178
Fazenda P, McDermott J, O’Reilly UM (2012) A library to run evolutionary algorithms in the
cloud using MapReduce. In: Di Chio C, et al. (eds.) Applications of evolutionary computing,
EvoApplications 2012, LNCS, vol. 7248. Springer, Malaga, pp 416–425
Friedman J (2001) Greedy function approximation: a gradient boosting machine. Ann Stat
29(5):1189–1232
Gustafson S, Sheth A (2014) Web of things. Computing Now 7(3).http://www.computer.org/web/
computingnow/archive/march2014
Icke I, Bongard J (2013) Improving genetic programming based symbolic regression using
deterministic machine learning. In: de la Fraga LG (ed.) 2013 IEEE conference on evolutionary
computation, Cancun, vol. 1, pp 1763–1770
Jones E, Oliphant E, Peterson P, et al. (2001) Scipy: open source scientific tools for python.http://
wwwscipyorg
Kordon AK, Smits GF (2001) Soft sensor development using genetic programming. In: Spector
L, et al. (eds.) Proceedings of the genetic and evolutionary computation conference (GECCO-
2001). Morgan Kaufmann, San Francisco, CA, pp 1346–1351
Koza JR (1992) The genetic programming paradigm: Genetically breeding populations of com-
puter programs to solve problems. In: Soucek B, the IRIS Group (eds.) Dynamic, genetic, and
chaotic programming. Wiley, New York, pp 203–321
Loukides M (2010) What is Data science? OReilly Radar Report.http://cdn.oreilly.com/radar/
2010/06/What_is_Data_Science.pdf
O’Neill M, Vanneschi L, Gustafson S, Banzhaf W (2010) Open issues in genetic programming.
Genet Program Evolvable Mach 11(3/4):339–363 (tenth Anniversary Issue: Progress in Genetic
Programming and Evolvable Machines)

Free download pdf