We interact with others even without utterances, because observation of others’bod-
ily motions often enables guessing their intentions. Others’bodily motions serve as a
major resource for mutual action and collaboration. In this article, we have analyzed,
qualitatively with multimodal and fine-grained transcripts, “implicit collaborations”
that are constituted by not only utterances but also bodily motions, and have revealed
the way people organize those. The example domain we have selected is a table cooking
of “monja-yaki”, because implicit collaboration occurs frequently during cooking. Our
findings are the following: (1) reading appropriate timing based on observations of each
other’s bodily motions made smooth transitions of cooking phases successful, and (2)
even when the current speaker, addressing a certain hearer, asked a question and the
hearer did not make an oral reply, the lack of reply did not cause any problem in their
communication if the hearer intended to reply by his or her bodily motions, and the
speaker properly attended to those. This way, communication and/or collaboration
hold even without oral turn-taking.