Youden's J statistic or Youden's index
is a single statistic that captures the performance of a dichotomous diagnostic test. Informedness is its generalization to the multiclass case and estimates the probability of an informed decision.
Cohen's kappa
A measure of how closely two judges agree on classification, taking into account the probability of judges agreeing via random chance.

Sensitivity
Ratio of True Positives divided by (True Positives + False Negatives) TP/(TP + FN)
Specificity
Ratio of True Negatives divided by (False Positives over True Negatives) TN/(TN + FP))
Positive Predictive Value
Ratio of True Positives divided by (True Positives plus False Positives) TP/(TP+FP)
Negative Predictive Value
Ratio of True Negatives divided by (True Negatives plus False Negatives) TN/(TN+FN)
Accuracy
Ratio of True Positives plus True Negatives divided by all positives and negatives (TP+TN)/(TP+TN+FP+FN)
Acronym AUC
Area under the Curve (For example see Receiver Operating Characteristics in Wikipedia.)
Acronym ROC
Receiver Operating Characteristics curve (see Wikipedia)
Cross Validation
sometimes called rotation estimation, is a model validation technique for assessing how the results of a statistical analysis will generalize to an independent data set.
Probe, Quiz, Test
Successive steps in assessing how well a machine learning algorithm performs on specific cases, improving the algorithm and then finally testing it against cases that WERE NOT USED TO TRAIN IT.
Random subsampling
A method for selecting test data versus training data
K-fold sampling
Break data into K equal size subsets.

Use 1 for testing and K-1 for training. Test each individual subset by training with it's K-1 counterparts.

Leave one out sampling
Literally leave one sample out of the training set and then test the prediction of that sample. Iterate by training and then testing on each sample, leaving out a different sample each iteration.
Bootstrap sampling
Random sampling with replacement. Underestimates error.

".632 bootstrap" is designed to try to compensate for that underestimation. See Wikipedia article on Cross Validation.

Acronym GLM
Generalized Linear Model
McNemar's Test
A statistical test used on paired nominal data. It is applied to 2 × 2 contingency tables with a dichotomous trait, with matched pairs of subjects, to determine whether the row and column marginal frequencies are equal (that is, whether there is "marginal homogeneity").

some valuable machine learning algorithms in R
Linear discriminant analysis, Regression, Naive Bayes, Support Vector Machines, Classification and regression trees, Random Forests, Boosting, etc.
Prevalence
The number of cases of a type that are present in a particular population at a given time
Incidence
The number of new cases that develop in a given period of time (disease, for example).
informedness
is the generalization of Youden's J statistic to the multiclass case and estimates the probability of an informed decision.
bias
?
confidence interval
?
Acronym LOOCV
Leave one out cross validation
variance
?
repeatedcv
repeated cross validation
grid search
?
model tuning
improving a model to better predict
Learning Vector Quantization (LVQ)
is a prototype-based supervised classification algorithm. LVQ is the supervised counterpart of vector quantization systems. It is a special case of an artificial neural network, more precisely, it applies a winner-take-all Hebbian learning-based approach.

It is a precursor to self-organizing maps (SOM) and related to neural gas, and to the k-Nearest Neighbor algorithm (k-NN). LVQ was invented by Teuvo Kohonen

Vector Quantization
Hebbian learning-based approach
Self organizing maps (SOM)
k-Nearest Neighbor algorithm (k-NN)
Inductive bias or learning bias
bias of a learning algorithm is the set of assumptions that the learner uses to predict outputs given inputs that it has not encountered
Supervised learning
inferring a function from labeled training data. The training data consist of a set of training examples. Each example is a pair consisting of an input object (typically a vector) and a desired output value (also called the supervisory signal).