- Kubinyi H, Folkers G, Martin YC (eds) (2006)
3D QSAR in drug design: recent advances.
Springer, Berlin
- Verma J, Khedkar VM, Coutinho EC (2010)
3D-QSAR in drug design-a review. Curr Top
Med Chem 10:95–115
- Breiman L, Friedman J, Stone CJ, Olshen RA
(1984) Classification and regression trees.
CRC Press, Boca Raton, FL
- Breiman L (2001) Random forests. Mach
Learn 45:5–32
- Ferri F, Pudil P, Hatef M, Kittler J (1994)
Comparative study of techniques for large-
scale feature selection. Pattern Recognit Pract
IV 1994:403–413
- Raschka S (2017) rasbt/mlxtend: Version
0.7.0. https://doi.org/10.5281/zenodo.
816309
- Hansen GJA, Jones ML (2008) A rapid assess-
ment approach to prioritizing streams for con-
trol of Great Lakes sea lampreys (Petromyzon
marinus): a case study in adaptive management.
Can J Fish Aquat Sci 65:2471–2484
- Irwin JJ, Shoichet BK (2005) ZINC—a free
database of commercially available compounds
for virtual screening. J Chem Inf Model
45:177–182
- Allen F (2002) The Cambridge Structural
Database: a quarter of a million crystal struc-
tures and rising. Acta Crystallogr Sect B Struct
Sci 58:380–388
- Johnson NS, Yun S-S, Li W (2014) Investiga-
tions of novel unsaturated bile salts of male sea
lamprey as potential chemical cues. J Chem
Ecol 40:1152–1160
- Van Rossum G (2007) Python programming
language. In: USENIX annual technical con-
ference, p 36
- Van Der Walt S, Colbert SC, Varoquaux G
(2011) The NumPy array: a structure for effi-
cient numerical computation. Comput Sci Eng
13:22–30
- Jones E, Oliphant T, Peterson P (2001) SciPy:
open source scientific tools for Python.http://
http://www.scipy.org/
- McKinney W, et al. (2010) Data structures for
statistical computing in Python. In: Millman J,
vand der Walt S (eds) Proceedings of the 9th
Python Science conference, pp 51–56
- Hunter JD (2007) Matplotlib: a 2D graphics
environment. Comput Sci Eng 9:90–95
- Pedregosa F, Varoquaux G, Gramfort A,
Michel V, Thirion B, Grisel O, Blondel M,
Prettenhofer P, Weiss R, Dubourg V (2011)
Scikit-learn: machine learning in Python. J
Mach Learn Res 12:2825–2830
- Aiello A, Carbonelli S, Esposito G,
Fattorusso E, Iuvone T, Menna M (2000)
Novel bioactive sulfated alkene and alkanes
from the Mediterranean ascidian Halocynthia
papillosa. J Nat Prod 63:1590–1592
- Raschka S (2015) Python machine learning,
1st edn. Packt Publishing, Birmingham
- Louppe G (2014) Understanding random for-
ests: from theory to practice. Ph.D. thesis
- Walker SH, Duncan DB (1967) Estimation of
the probability of an event as a function of
several independent variables. Biometrika
54:167–179
- Hughes G (1968) On the mean accuracy of
statistical pattern recognizers. IEEE Trans Inf
Theory 14:55–63
- Raschka S, Mirjalili V (2017) Python machine
learning, 2nd edn. Packt Publishing,
Birmingham
- Raschka S, Julian D, Hearty J (2016) Python:
deeper insights into machine learning, 1st edn.
Packt Publishing, Birmingham
- Hastie T, Tibshirani R, Friedman J, Hastie T,
Tibshirani R (2001) Springer series in statistics.
Springer, New York, NY
- Mu ̈ller AC, Guido S (2017) Introduction to
machine learning with Python: a guide for data
scientists. O’Reilly Media, Sebastopol, CA
- Hawkins PCD, Skillman AG, Warren GL,
Ellingson BA, Stahl MT (2010) Conformer
generation with OMEGA: algorithm and vali-
dation using high quality structures from the
Protein Databank and Cambridge Structural
Database. J Chem Inf Model 50:572–584
- Hawkins PCD, Nicholls A (2012) Conformer
generation with OMEGA: learning from the
data set and the analysis of failures. J Chem
Inf Model 52:2919–2936
- Raschka S (2017) BioPandas: working with
molecular structures in pandas DataFrames. J
Open Source Softw. doi:10.21105/joss.00279
- Strobl C, Boulesteix A, Kneib T, Augustin T,
Zeileis A (2008) Conditional variable impor-
tance for random forests. BMC Bioinformatics
9:307
- Strobl C, Malley J, Tutz G (2009) An intro-
duction to recursive partitioning: rationale,
application, and characteristics of classification
and regression trees, bagging, and random for-
ests. Psychol Methods 14:323
Inferring Activity Discriminants 337