![]() |
steven t. piantadosi spiantado@gmail.com @spiantado.bsky.social google scholar github lab cv |
I am a professor at UC Berkeley in psychology and the Helen Wills Neuroscience Institute, where I head the computation and
language lab (colala). I am an NCSE Steve who really likes free software, wikipedia, and accordions. I also sometimes teach math at Mt. Tamalpais College. My research uses formal computational methods and behavioral experiments to study how people learn language and create conceptual systems. You can read about some of my work
on information and language, language acquisition, ambiguity, and the evolution of human-like
cognition.
I am a co-founder of Science Homecoming, which encourages scientists to reach out to their hometown newspapers to explain the importance of NIH and NSF.
My partner, Meghan Dorrian, is an architect at Young America Creative.
I was one of nine faculty and students who pursed an EEOC complaint and lawsuit against the University of Rochester over sexual harassment[1][2][3] and retaliation. After the president of the university resigned over the case, the lawsuit was settled in 2020 and the university's new administration thanked the plaintiffs for their efforts.
Diverse mathematical knowledge among indigenous Amazonians.
One model for the learning of language.
The communicative function of ambiguity in language.
Zipf’s word frequency law in natural language: A critical review and future directions.
Bootstrapping in a language of thought: A formal model of numerical concept learning.
A unified account of numerosity perception.
All data and code from papers published and in progress is available upon request.
I develop several free libraries for research in cognitive science:
2025 | |
[141] | The End of Radical Concept Nativism ( ), In , 2025. |
2024 | |
[140] | Reliable Reasoning Beyond Natural Language ( ), In arXiv, 2024. |
[139] | Uniquely human intelligence arose from expanded information capacity ( ), In Nature Reviews Psychology, 2024. |
[138] | Limited information-processing capacity in vision explains number psychophysics ( ), In Psychological Review, 2024. |
[137] | Response to difficulty drives variation in IQ test performance ( ), In Open Mind, volume 8, 2024. |
[136] | Language is primarily a tool for communication rather than thought ( ), In Nature, 2024. |
[135] | Continuous and Discrete Proportions Elicit Different Cognitive Strategies ( ), In Cognition, 2024. |
[134] | Formalising the role of behaviour in neuroscience ( ), In European Journal of Neuroscience, 2024. |
[133] | Why concepts are (probably) vectors ( ), In Trends in Cognitive Sciences, 2024. |
[132] | Modern language models refute Chomsky's approach to language ( ), Chapter in From fieldwork to linguistic theory: A tribute to Dan Everett (Empirically Oriented Theoretical Morphology and Syntax 15) (Edward Gibson, Moshe Poliak, eds.), Berlin: Language Science Press, 2024. |
[131] | Symbolic metaprogram search improves learning efficiency and explains rule learning in humans ( ), In Nature Communications, 2024. |
2023 | |
[130] | Language processing and language learning ( ), Chapter in Bayesian Models of Cognition: Reverse Engineering the Mind (Thomas L. Griffiths, Nick Chater, Joshua Tenenbaum, eds.), 2023. |
[129] | Origins of Hierarchical Logical Reasoning ( ), In Cognitive Science, volume 47, 2023. |
[128] | Cognitive Mechanisms Underlying Recursive Pattern Processing in Human Adults ( ), In Cognitive Science, volume 47, 2023. |
[127] | The Plausibility of Sampling as an Algorithmic Theory of Sentence Processing ( ), In Open Mind, volume 7, 2023. |
[126] | Latent diversity in conceptual representation ( ), In Open Mind, volume 7, 2023. |
[125] | Diverse mathematical knowledge among indigenous Amazonians ( ), In Proceedings of the National Academy of Sciences, volume 120, 2023. |
[124] | Trans-inclusive gender categories are cognitively natural ( ), In Nature Human Behavior, volume 7, 2023. |
[123] | The Algorithmic Origins of Counting ( ), In Child Development, volume 94, 2023. |
[122] | How to enumerate trees from a context-free grammar ( ), In arXiv, 2023. |
[121] | Learning as Bayesian inference over programs ( ), Chapter in Bayesian Models of Cognition: Reverse Engineering the Mind (Thomas L. Griffiths, Nick Chater, Joshua Tenenbaum, eds.), 2023. |
[120] | No clear evidence for a left-to-right mental number line in insects ( ), In Proceedings of the National Academy of Sciences (commentary), 2023. |
[119] | Real-time pragmatic inference across cultures: evidence from a non-industrialized society ( ), In Journal of Experimental Psychology: General, volume 152, 2023. |
[118] | Sampling in Approximate Number Perception ( ), In Proceedings of the Annual Meeting of the Cognitive Science Society, 2023. |
[117] | Children’s Estimation of Peripheral Information Drives Improvements in Approximate Number Sense ( ), In Proceedings of the Annual Meeting of the Cognitive Science Society, 2023. |
2022 | |
[116] | Culture and Commutativity ( ), In Proceedings of the Annual Meeting of the Cognitive Science Society, 2022. |
[115] | Verbal counting and the timing of number acquisition in an indigenous Amazonian group ( ), In PLOS ONE, 2022. |
[114] | Stochastic time-series analyses highlight the day-to-day dynamics of lexical frequencies ( ), In Cognitive Science, volume 46, 2022. |
[113] | Investigating Adults’ Strategy Use During Proportional Comparison ( ), In Proceedings of the Annual Meeting of the Cognitive Science Society, 2022. |
[112] | Reply to Kodner et al: Fundamental misunderstanding of both model and methods ( ), In Proceedings of the National Academy of Sciences (response to commentary), 2022. |
[111] | Meaning without reference in large language models ( ), In arXiv preprint arXiv:2208.02957, 2022. |
[110] | Reply to Murphy et al: Program induction can learn language ( ), In Proceedings of the National Academy of Sciences (response to commentary), 2022. |
[109] | Exact number concepts are limited to the verbal count range ( ), In Psychological Science, volume 33, 2022. |
[108] | Different reference frames on different axes: Space and language in indigenous Amazonians ( ), In Science Advances, volume 8, 2022. |
[107] | Learning as programming: Efficient search in models of human concept learning ( ), In Proceedings of the Cognitive Science Society, 2022. |
[106] | Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models ( ), In arXiv preprint arXiv:2206.04615, 2022. |
[105] | One model for the learning of language ( ), In Proceedings of the National Academy of Sciences, volume 119, 2022. |
2021 | |
[104] | The evolution of quantitative sensitivity ( ), In Philosophical Transactions of the Royal Society B, volume 377, 2021. |
[103] | The psychophysics of number arise from resource-limited spatial memory ( ), In Proceedings of the Cognitive Science Society, 2021. |
[102] | The Natural Stories Corpus ( ), In Language Resources and Evaluation, volume 55, 2021. |
[101] | Logical Word Learning: The case of kinship ( ), In Psychonomic Bulletin & Review, 2021. |
[100] | The Cultural Origins of Symbolic Number ( ), In Psychological Review, volume 129, 2021. |
[99] | Probability, Belief, and the Richness of Cognition ( ), Chapter in The Cognitive Science of Belief (Julien Musolino, Joseph Sommer, Pernille Hemmer, eds.), Cambridge University Press, 2021. |
[98] | The computational origin of representation ( ), In Minds and Machines, volume 31, 2021. |
[97] | Spatial concepts of number, size, and time in an indigenous culture ( ), In Science Advances, volume 7, 2021. |
[96] | Variation in spatial concepts: Different frames of reference on different axes ( ), In Proceedings of the Cognitive Science Society, 2021. |
[95] | Uncontrolled corpus composition drives an apparent surge in cognitive distortions (Commentary on Bollen et al.) ( ), In Proceedings of the National Academy of Sciences (commentary), 2021. |
2020 | |
[94] | A unified account of numerosity perception ( ), In Nature Human Behavior, volume 4, 2020. |
[93] | Recursive sequence generation in monkeys, children, US adults, and native Amazonians ( ), In Science Advances, volume 6, 2020. |
[92] | Simple models of sequential processing cannot explain center-embedded generalizations ( ), In Science Advances eLetters, 2020. |
[91] | A model of temporal connective acquisition ( ), In Proceedings of the Cognitive Science Society, 2020. |
[90] | People Infer Recursive Visual Concepts from Just a Few Examples ( ), In Computational Brain and Behavior, volume 3, 2020. |
[89] | Composition is the core driver of the language-selective network ( ), In Neurobiology of Language, volume 1, 2020. |
[88] | Multi-directional mappings in the minds of the Tsimane': Size, time, and number on three spatial axes ( ), In Proceedings of the Cognitive Science Society, 2020. |
[87] | The Child as Hacker ( ), In Trends in Cognitive Science, volume 24, 2020. |
[86] | The neural basis of predictive pursuit ( ), In Nature Neuroscience, volume 23, 2020. |
2019 | |
[85] | Intrinsic whole number bias in an indigenous population ( ), In Proceedings of the Cognitive Science Society, 2019. |
[84] | A primarily serial, foveal accumulator underlies approximate numerical estimation ( ), In Proceedings of the National Academy of Sciences, volume 116, 2019. |
[83] | Why we should abandon the Semantic Subset Principle ( ), In Language Learning and Development, volume 15, 2019. |
[82] | How Efficiency Shapes Human Language ( ), In Trends in Cognitive Science, volume 23, 2019. |
[81] | One-to-one correspondence without Language ( ), In Royal Society Open Science, 2019. |
[80] | Humans store about 1.5 megabytes of information during language acquisition ( ), In Royal Society Open Science, 2019. |
2018 | |
[79] | A threshold free model of number comparison ( ), In PLOS ONE, 2018. |
[78] | Intrinsic whole number bias in humans ( ), In Journal of Experimental Psychology: Human Perception and Performance, volume 44, 2018. |
[77] | Robust mixture modeling reveals category-free selectivity in reward region neuronal ensembles ( ), In Journal of Neurophysiology, volume 119, 2018. |
[76] | Birth season and height among girls and boys below 12 years of age: Lasting effects and catch-up growth among native Amazonians in Bolivia ( ), In Annals of Human Biology, volume 45, 2018. |
[75] | Word forms are structured for efficient use ( ), In Cognitive Science, volume 42, 2018. |
[74] | Certainty is Primarily Determined by Past Performance during Concept Learning ( ), In Open Mind, volume 2, 2018. |
[73] | Adults use gradient similarity information in compositional rules ( ), In Proceedings of the Cognitive Science Society, 2018. |
[72] | Limits on Composition of Conceptual Operations in 9-Month-Olds ( ), In Infancy, volume 23, 2018. |
[71] | One parameter is always enough ( ), In AIP Advances, volume 8, 2018. |
[70] | Learning list concepts through program induction ( ), In Proceedings of the Cognitive Science Society, 2018. |
[69] | Child stunting is associated with weaker human capital among native Amazonians ( ), In American Journal of Human Biology, volume 30, 2018. |
2017 | |
[68] | Beyond Reward Prediction Errors: Human Striatum Updates Rule Values During Learning ( ), In Cerebral Cortex, volume 28, 2017. |
[67] | Knowledge transfer in a probabilistic Language of Thought ( ), In Proceedings of the Cognitive Science Society, 2017. |
[66] | Wordform similarity increases with semantic similarity: an analysis of 100 languages ( ), In Cognitive Science, volume 41, 2017. |
[65] | Words cluster phonetically beyond phonotactic regularities ( ), In Cognition, volume 163, 2017. |
[64] | Universal and uniquely human factors in spontaneous number perception ( ), In Nature Communications, volume 8, 2017. |
[63] | Color naming across languages reflects color use ( ), In Proceedings of the National Academy of Sciences, National Acad Sciences, volume 114, 2017. |
[62] | Post Hoc Analysis Decisions Drive the Reported Reading Time Effects in Hackl, Koster-Hale & Varvoutis (2012) ( ), In Journal of Semantics, volume 34, 2017. |
[61] | The use of a computer display exaggerates the connection between exact and approximate number ability in remote populations ( ), In Open Mind, volume 1, 2017. |
[60] | How data drives early word learning: A cross-linguistic waiting time analysis ( ), In Open Mind, volume 1, 2017. |
[59] | An incremental information-theoretic buffer supports sentence processing ( ), In Proceedings of the Cognitive Science Society, 2017. |
[58] | A Rational Constructivist Account of the Characteristic-to-Defining Shift ( ), In Proceedings of the Cognitive Science Society, 2017. |
[57] | Learning abstract visual concepts via probabilistic program induction in a Language of Thought ( ), In Cognition, volume 168, 2017. |
[56] | True Numerical Cognition in the Wild ( ), In Psychological Science, volume 28, 2017. |
2016 | |
[55] | Inferring priors in compositional cognitive models ( ), In Proceedings of the Cognitive Science Society, 2016. |
[54] | A large dataset of generalization patterns in the number game ( ), In Journal of Open Psychology Data, volume 4, 2016. |
[53] | A Corpus Investigation of Syntactic Embedding in Pirahã ( ), In PLOS ONE, 2016. |
[52] | Mastery of the logic of natural numbers is not the result of mastery of counting: Evidence from late counters ( ), In Developmental Science, volume 20, 2016. |
[51] | Native Amazonian Children Forego Egalitarianism When They Learn to Count ( ), In Developmental Science, volume 19, 2016. |
[50] | What determines human certainty? ( ), In Proceedings of the Cognitive Science Society, 2016. |
[49] | A Hierarchical Probabilistic Language-of-Thought Model of Human Visual Concept Learning ( ), In Proceedings of the Cognitive Science Society, 2016. |
[48] | Infinitely productive language can arise from chance under communicative pressure ( ), In Journal of Language Evolution, volume 2, 2016. |
[47] | Compositional reasoning in early childhood ( ), In PLOS ONE, 2016. |
[46] | Efficient estimation of Weber's W ( ), In Behavior Research Methods, volume 48, 2016. |
[45] | Endogenous or exogenous? The data don’t say (Commentary on Han, Musolino, & Lidz 2016) ( ), In Proceedings of the National Academy of Sciences, volume 113, 2016. |
[44] | Extraordinary intelligence and the care of infants ( ), In Proceedings of the National Academy of Sciences, volume 113, 2016. |
[43] | Four problems solved by the probabilistic Language of Thought ( ), In Current Directions in Psychological Science, volume 25, 2016. |
[42] | The logical primitives of thought: Empirical foundations for compositional cognitive models ( ), In Psychological Review, volume 123, 2016. |
[41] | A rational analysis of the approximate number system ( ), In Psychonomic Bulletin and Review, 2016. |
2015 | |
[40] | Cognition in reach: continuous statistical inference in optimal motor planning ( ), In Proceedings of the Cognitive Science Society, 2015. |
[39] | The origins of counting algorithms ( ), In Psychological Science, volume 26, 2015. |
[38] | A pragmatic account of complexity in definite Antecedent-Contained-Deletion relative clauses ( ), In Journal of Semantics, volume 32, 2015. |
[37] | Inferring the Tsimane's use of color categories from recognition memory ( ), In Proceedings of the Cognitive Science Society, 2015. |
[36] | The perceptual foundation of linguistic context ( ), In Proceedings of the Cognitive Science Society, 2015. |
[35] | Towards semantically rich and recursive word learning models ( ), In Proceedings of the Cognitive Science Society, 2015. |
[34] | The dynamics of idealized attention in complex learning environments ( ), In The 5th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, 2015. |
[33] | Utility-free models of binomial choice can replicate predictions of utility models in many conditions ( ), In Frontiers in Neuroscience, 2015. |
[32] | Problems in the philosophy of mathematics: A view from cognitive science ( ), Chapter in Mathematics, Substance and Surmise: Views on the Meaning and Ontology of Mathematics (Ernest Davis, Philip J. Davis, eds.), Springer, 2015. |
[31] | Response: "Commentary: Utility-free heuristic models of two-option choice can mimic predictions of utility-stage models under many conditions" ( ), In Frontiers in Neuroscience, volume 9, 2015. |
2014 | |
[30] | The Goldilocks Effect in Infant Auditory Attention ( ), In Child Development, volume 85, 2014. |
[29] | Children's learning of number words in an indigenous farming-foraging group ( ), In Developmental Science, volume 17, 2014. |
[28] | Quantitative Standards for Absolute Linguistic Universals ( ), In Cognitive Science, volume 38, 2014. |
[27] | Rich analysis and rational models: Inferring individual behavior from infant looking data ( ), In Developmental Science, volume 17, 2014. |
[26] | Zipf’s word frequency law in natural language: A critical review and future directions ( ), In Psychonomic Bulletin & Review, Springer US, volume 21, 2014. |
2013 | |
[25] | Quantitative methods in syntax / semantics research: A response to Sprouse & Almeida ( ), In Language and Cognitive Processes, volume 28, 2013. |
[24] | The rational integration of noise and prior semantic expectation: Evidence for a noisy-channel model of sentence interpretation ( ), In Proceedings of the National Academy of Sciences, volume 11, 2013. |
[23] | Information content versus word length in natural language: A reply to Ferrer-i-Cancho and Moscoso del Prado Martin [arXiv:1209.1751] ( ), In ArXiv e-prints, 2013. |
[22] | Put your money where your mouth is: Incentivizing the Truth by Making Nonreplicability Costly ( ), In European Journal of Personality, 2013. |
2012 | |
[21] | The interaction of syntactic and lexical information sources in language processing: The case of the noun–verb ambiguity ( ), In Journal of Cognitive Science, 2012. |
[20] | Processing Relative Clauses in Supportive Contexts ( ), In Cognitive Science, Wiley Online Library, volume 36, 2012. |
[19] | A noisy-channel account of crosslinguistic word order variation ( ), In Psychological Science, volume 24, 2012. |
[18] | The Goldilocks Effect: Human Infants Allocate Attention to Visual Sequences That Are Neither Too Simple Nor Too Complex ( ), In PLoS ONE, 2012. |
[17] | Info/information theory: speakers actively choose shorter words in predictable contexts ( ), In Cognition, volume 126, 2012. |
[16] | Bootstrapping in a language of thought: a formal model of numerical concept learning ( ), In Cognition, volume 123, 2012. |
[15] | A corpus analysis of Pirahã grammar: An investigation of recursion ( ), In Talk presented at the LSA (by E. Gibson)., 2012. |
2011 | |
[14] | Using Mechanical Turk to Obtain and Analyze English Acceptability Judgments ( ), In Language and Linguistics Compass, Wiley Online Library, volume 5, 2011. |
[13] | The communicative function of ambiguity in language ( ), In Cognition, volume 122, 2011. |
[12] | Learning and the language of thought ( ), PhD thesis, MIT, 2011. |
[11] | Reply to Reilly and Kean: Clarifications on word length and information content ( ), In Proceedings of the National Academy of Sciences (response to commentary), National Acad Sciences, volume 108, 2011. |
[10] | Word lengths are optimized for efficient communication ( ), In Proceedings of the National Academy of Sciences, National Acad Sciences, volume 108, 2011. |
2010 | |
[9] | The Goldilocks Effect: Infants' preference for visual stimuli that are neither too predictable nor too surprising ( ), In Proceedings of the Cognitive Science Society, 2010. |
[8] | Beyond Boolean logic: exploring representation languages for learning complex concepts ( ), In Proceedings of the Cognitive Science Society, 2010. |
[7] | How the Dimension of Space Affects the Products of Pre-Biotic Evolution: The Spatial Population Dynamics of Structural Complexity and The Emergence of Membranes ( ), In Santa Fe Institute Working Paper arXiv:1010.5019, 2010. |
2009 | |
[6] | The communicative lexicon hypothesis ( ), In Proceedings of the Cognitive Science Society, 2009. |
[5] | Refer efficiently: Use less informative expressions for more predictable meanings ( ), In Proceedings of the workshop on the production of referring expressions: Bridging the gap between computational and empirical approaches to reference, 2009. |
2008 | |
[4] | A Bayesian model of the acquisition of compositional semantics ( ), In Proceedings of the Cognitive Science Society, 2008. |
[3] | Symbolic dynamics on free groups ( ), In Discrete and Continuous Dynamical Systems, volume 20, 2008. |
under review | |
[2] | Indigenous Amazonians spontaneously use space to offload cognitive demands ( ), In , under review. |
[1] | Algorithm induction in indigenous Amazonian children ( ), In , under review. |
My thesis studied learning in language of thought models, in which learners compose simple functions in order to express complex concepts like those needed for natural language. A precis is available here.