D. Koller and M. Sahami (1997). "Hierarchically classifying documents using very few words." Proceedings of the 14th International Conference on Machine Learning (ICML) (pp. 170-178).
D. Koller and M. Sahami (1996). "Toward Optimal Feature Selection." Proceedings of the Thirteenth International Conference on Machine Learning (ICML) (pp. 284-292).