East European Journal of Psycholinguistics               ISSN 2313-2116 
Home » Journal papers » Volume 5, Number 1, 2018 » Taraban, R., Koduru, L., LaCour, M. Marshall, P.

Finding a Common Ground in Human and Machine-Based Text Processing
[ Full text PDF (253.5 Kb) ] 2018-Jun-28, 7.05.49 AM

Abstract. Language makes human communication possible. Apart from everyday applications, language can provide insights into individuals’ thinking and reasoning. Machine-based analyses of text are becoming widespread in business applications, but their utility in learning contexts are a neglected area of research. Therefore, the goal of the present work is to explore machine-assisted approaches to aid in the analysis of students’ written compositions. A method for extracting common topics from written text is applied to 78 student papers on technology and ethics. The primary tool for analysis is the Latent Dirichlet Allocation algorithm. The results suggest that this machine-based topic extraction method is effective and supports a promising prospect for enhancing classroom learning and instruction. The method may also prove beneficial in other applied applications, like those in clinical and counseling practice.

Keywords: natural language processing, machine-analysis, latent Dirichlet allocation, text analysis, classroom learning, clinical and counseling practice.


Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet Allocation. Journal of Machine Learning Research 3, 993-1022.

Bruner, J. (1990). Acts of meaning. Cambridge, MA: Harvard University Press.

Chen, K. Y. M., & Wang, Y. (2007). Latent dirichlet allocation. http://acsweb.ucsd.edu/~yuw176/ report/lda.pdf.

Chung, C. K., & Pennebaker, J. W. (2008). Revealing dimensions of thinking in open-ended self-descriptions: An automated meaning extraction method for natural language. Journal of research in personality, 42(1), 96-132.

Feldman, S. (1999). NLP meets the Jabberwocky: Natural language processing in information retrieval. Online Magazine, 23, 62-73. Retrieved from: http://www.onlinemag.net/OL1999/ feldmann5.html

Mishlove, J. (2010). https://www.youtube.com/watch?v=0XTDLq34M18 (Accessed June 12, 2018).

Ostrowski, D. A. (2015). Using latent dirichlet allocation for topic modelling in twitter. In Semantic Computing (ICSC), 2015 IEEE International Conference (pp. 493-497). IEEE.

Pennebaker, J. W. (2004). Theories, therapies, and taxpayers: On the complexities of the expressive writing paradigm. Clinical Psychology: Science and Practice, 11(2), 138-142.

Pennebaker, J.W., Boyd, R.L., Jordan, K., & Blackburn, K. (2015). The development and psychometric properties of LIWC 2015. Austin, TX: University of Texas at Austin.

Pennebaker, J. W., Chung, C. K., Frazee, J., Lavergne, G. M., & Beaver, D. I. (2014). When small words foretell academic success: The case of college admissions essays. PLoS ONE, 9(12), e115844.

Pennebaker, J. W., & King, L. A. (1999). Linguistic styles: Language use as an individual difference. Journal of Personality and Social Psychology, 77(6), 1296-1312.

Recchia, G., Sahlgren, M., Kanerva, P., & Jones, M. N. (2015). Encoding sequential information in semantic space models: Comparing holographic reduced representation and random permutation. Computational intelligence and neuroscience, 2015, 1-18.

Salzmann, Z. (2004). Language, Culture, and Society: An Introduction to Linguistic Anthropology (3rd ed). Westview Press.

Schank, R. C., Goldman, N. M., Rieger III, C. J., & Riesbeck, C. (1973). MARGIE: Memory analysis response generation, and inference on English. In IJCAI, 3, 255-261.

Taraban, R., Marcy, W. M., LaCour Jr., M. S., & Burgess II, R. A. (2017). Developing machine-assisted analysis of engineering students’ ethics course assignments. Proceedings of the American Society of Engineering Education (ASEE) Annual Conference, Columbus, OH. https://www.asee.org/public/conferences/78/papers/19234/view.

Taraban, R., Marcy, W. M., LaCour, M. S., Pashley, D., & Keim, K. (2018). Do engineering students learn ethics from an ethics course? Proceedings of the American Society of Engineering Education – Gulf Southwest (ASEE-GSW) Annual Conference, Austin, TX. http://www.aseegsw18.com/papers.html.

Taraban, R., & Marshall, P. H. (2017). Deep learning and competition in psycholinguistic research. East European Journal of Psycholinguistics, 4(2), 67-74.

Weizenbaum, J. (1966). ELIZA—a computer program for the study of natural language communication between man and machine. Communications of the ACM, 9(1), 36-45.

Winograd, T. (1972). Understanding natural language. New York: Academic Press.

Category: Taraban, R., Koduru, L., LaCour, M. Marshall, P. | Added by: eejpl
Views: 58 | Downloads: 9