Jason Geller, RutÂgers UniÂverÂsiÂty
@jgeller_phd
Full TranÂscript:
Jason Geller:
So first off, I want to thank Rachel for invitÂing me to speak on this panÂel. For the past couÂple of years, I’ve been an obserÂvant of The BeOnline ConÂferÂence, so it’s realÂly aweÂsome to actuÂalÂly be a parÂticÂiÂpant and talkÂing about audiÂtoÂry research online. So today, I’m going to be talkÂing to you about a project that I startÂed workÂing on when I was a postÂdoc at the UniÂverÂsiÂty of Iowa. And while there, myself and also colÂleagues develÂoped the task called the Iowa Test of ConÂsoÂnant PerÂcepÂtion, and what we did is we tried to valÂiÂdate it. So that is what I’m going to be talkÂing to you about today.
Jason Geller:
So to start, I want you to imagÂine that you’re at a bar, pre panÂdemÂic, and you’re havÂing a conÂverÂsaÂtion. You’re tradÂing conÂverÂsaÂtion back and forth, and while you’re doing this there’s trafÂfic comÂing into your domain, there’s a blarÂing music, and there are othÂer peoÂple talkÂing. So a critÂiÂcal quesÂtion for speech perÂcepÂtion is how are we able to attend to the conÂverÂsaÂtion that we’re havÂing with the peoÂple close to us while ignorÂing all this extraÂneÂous noise that’s also occurÂring conÂcurÂrentÂly while we’re tryÂing to have this conÂverÂsaÂtion? So this is kind of the clasÂsic cockÂtail parÂty problem.
Jason Geller:
So one way that we can kind of assess this speech and noise issue is by using speech and noise tasks. This is what audiÂolÂoÂgists and also labÂoÂraÂtoÂry researchers use, and they come in two flaÂvors, there is an open-set tasks and then there’s closed-set tasks, and underÂneath those dimenÂsions there are sinÂgle word recogÂniÂtion tasks, as well as senÂtence based tasks. And what would a parÂticÂiÂpant see in an open-set task? So let me play you an examÂple of that.
recordÂing:
[crosstalk 00:01:50].
Jason Geller:
So a word or a senÂtence would be spliced into that mulÂti-speakÂer babÂble and indiÂvidÂuÂals would have to kind of search their menÂtal lexÂiÂcon, choose the word they think they heard, and then they have to proÂduce it. And if you weren’t able to hear what that word was in the mulÂti-speakÂer babÂble, it was ball.
Jason Geller:
In conÂtrast, closed-set tasks don’t have a proÂducÂtion eleÂment, the same as the open-set tasks. Instead, they usuÂalÂly have a forced choice task where they’re preÂsentÂed with sevÂerÂal options and they have to choose which one they think it is. And like I said before, it was ball that was interÂspersed into that speech and noise.
Jason Geller:
So genÂerÂalÂly speakÂing, senÂtence-based, open-set tasks are genÂerÂalÂly preÂferred as they’re more repÂreÂsenÂtaÂtive of everyÂday lisÂtenÂing sitÂuÂaÂtions, so they’re more ecoÂlogÂiÂcalÂly valid. HowÂevÂer, open-set tasks are difÂfiÂcult to use experÂiÂmenÂtalÂly, right? So a senÂtence-based open-set task would engage a whole host of processÂes that are not directÂly relatÂed to speech perÂcepÂtion. So as I said before, open-set tasks require proÂducÂtion, so if indiÂvidÂuÂals have a lanÂguage impairÂment such as aphaÂsia, they wouldÂn’t be able to do that task. SenÂtence-based tasks require workÂing memÂoÂry dependÂing on how hard or synÂtacÂtiÂcalÂly comÂplex the senÂtences is, and also it relies on conÂtext. So indiÂvidÂuÂals can use conÂtext to infer maybe upcomÂing words. So again, it’s not directÂly tapÂping speech perception.
Jason Geller:
So what we need is a closed-set task that betÂter approxÂiÂmates everyÂday lisÂtenÂing sitÂuÂaÂtions. So in everyÂday lisÂtenÂing sitÂuÂaÂtions, there’s lexÂiÂcal comÂpeÂtiÂtion, so repÂreÂsenÂtaÂtions are batÂtling each othÂer for selecÂtion, and then there’s also talkÂer variÂabilÂiÂty. So difÂferÂent talkÂers, and also speech might be accentÂed or not, so we have to take that into account. With those goals in mind, we set out to creÂate a task called the Iowa Test of ConÂsoÂnant PerÂcepÂtion that would hopeÂfulÂly meet those goals. This parÂticÂuÂlar task is a four alterÂnaÂtive word choice closed-set task. There’s 120 tarÂget words, and each tarÂget word belongs to a set, and withÂin that set, it appears both as a tarÂget and a foil. We recordÂed each tarÂget word with four speakÂers, so two women, two males, and all of the foils are minÂiÂmal pairs difÂferÂent by the first conÂsoÂnant. And the noise, we use a mulÂti-speakÂer babÂble. So this is an examÂple of the mulÂti-speakÂer babble.
recordÂing:
[crosstalk 00:04:19]
Jason Geller:
What I want to point out here is that all of the analyÂsis scripts, mateÂriÂals and data for the Iowa Test of ConÂsoÂnant PerÂcepÂtion are availÂable at our OSF page, so we’re hopÂing that indiÂvidÂuÂals could use this to repliÂcate our results here or roll their own Iowa Test of ConÂsoÂnant Perception.
Jason Geller:
So, when we startÂed this valÂiÂdaÂtion project, we weren’t in a panÂdemÂic, so data colÂlecÂtion was going pretÂty well. And then the panÂdemÂic hapÂpened and metaphorÂiÂcalÂly speakÂing, peoÂple left the bar. We couldÂn’t have peoÂple in the lab anyÂmore, so we kind of had to decide on an alterÂnaÂtive. And I decidÂed that we should try to valÂiÂdate this online. So as Bob Dylan said, “The times they are a‑changing.” And more and more researchers are putting their experÂiÂments online. And a lot of audiÂtoÂry researchers, as we have heard today are also takÂing the research online. So I thought that it would be perÂfect to try to valÂiÂdate this online.
Jason Geller:
So for the proÂceÂdure, we had two sesÂsions and these were spaced one week apart and we used GorilÂla as our experÂiÂmenÂtal and hostÂing platÂform. And we use ProÂlifÂic as our recruitÂment platform.
Jason Geller:
So in sesÂsion one, we had 199 parÂticÂiÂpants and indiÂvidÂuÂals first did a headÂphone screenÂer. So we used the [Woods At All 00:05:39] headÂphone screenÂer that Rachel talked about. Then after that, they did the Iowa Test of ConÂsoÂnant PerÂcepÂtion, and this was 240 triÂals with two speakÂers. Then after that, they did the ConÂsoÂnant-NucleÂus-ConÂsoÂnant test, which is a hunÂdred words in noise. And the reaÂson why we chose this parÂticÂuÂlar test is because it’s what’s being used in UniÂverÂsiÂty of Iowa HosÂpiÂtals. So we wantÂed to look at corÂreÂlaÂtions between this and anothÂer test.
Jason Geller:
In sesÂsion two, 98 parÂticÂiÂpants returned. The attriÂtion rate is not the greatÂest, but it is what it is. For sesÂsion two, indiÂvidÂuÂals had to comÂplete a headÂphone screenÂer again. Then they were givÂen the Iowa Test of ConÂsoÂnant PerÂcepÂtion again. This is 240 triÂals and we chose two difÂferÂent speakÂers. And the reaÂson why we had two difÂferÂent speakÂers is so there wasÂn’t any learnÂing affects. After this, they did the AZbio, which is just 20 senÂtences in noise. And again, we’re using this AZbio test because it’s what’s being used at the UniÂverÂsiÂty of Iowa hosÂpiÂtals and the clinÂics. Then after this, they did some demographics.
Jason Geller:
So what did the parÂticÂiÂpants actuÂalÂly see? So all of these are availÂable on open mateÂriÂals, so why don’t I just show you? So first, let’s look at the CNC task and what they did.
recordÂing:
[crosstalk 00:06:55] talk [crosstalk 00:06:55].
Jason Geller:
Yeah. So there’s a fixÂaÂtion cross, and then there’s a word interÂspersed in that noise and you just have to type in what you thought you heard.
recordÂing:
[crosstalk 00:07:05] cake [crosstalk 00:07:07].
Jason Geller:
Again. And the AZbio is very simÂiÂlar, but instead of a word, there’s a senÂtence and they had to type out the senÂtence that they thought that they heard. For the ITCP, which we have a code name for, is isn’t, and this is very similar.
recordÂing:
[crosstalk 00:07:25]
Jason Geller:
So they hear the word and noise and then there’s four choicÂes for them to choose from. And this is the pracÂtice triÂal, so there’s feedÂback, but they would pick maybe that they heard gone, and that’s incorÂrect. So, that is what these tasks look like online.
Jason Geller:
Okay, so back to the preÂsenÂtaÂtion. So before I get into the valÂiÂdaÂtion piece, what we wantÂed to do was pilot the stimÂuli. So what we did is we ran a study with 50 parÂticÂiÂpants and we assessed all of these words just in silence so we could get kind of a overÂall intelÂliÂgiÂbilÂiÂty of these stimÂuli. And overÂall accuÂraÂcy was about 95%, so that’s good.
Jason Geller:
Now let’s get into the valÂiÂdaÂtion piece. So what we realÂly wantÂed to know was, what is the reliÂaÂbilÂiÂty of the ITCP? And we did this by lookÂing at test-retest. So we had indiÂvidÂuÂals come in durÂing sesÂsion one to do the ITCP and then a week latÂer they did the ITCP again. So using the inter-class corÂreÂlaÂtion, which is a meaÂsure of agreeÂment, we get high reliÂaÂbilÂiÂty. So 0.8, which is good. And this is kind of just a scatÂter cloud of sesÂsion one of the ITCP and sesÂsion two of the ITCP, and we can see that there’s kind of this posÂiÂtive large correlation.
Jason Geller:
We were also interÂestÂed in just lookÂing at how the ITCP corÂreÂlates with the othÂer tasks that we had them do. So for this, we looked at sesÂsion one of the ITCP and the CNC and what we observed is a corÂreÂlaÂtion of 0.54, and this is actuÂalÂly a robust meaÂsure of corÂreÂlaÂtion, so it’s perÂcentÂage bend, which takes into account some of these outÂliers. And we get a corÂreÂlaÂtion of about 0.54. While it’s posÂiÂtive and fairÂly large by conÂvenÂtionÂal stanÂdards, it’s not realÂly psyÂchoÂmeÂtÂriÂcalÂly where we wantÂed to be, which is unfortunate.
Jason Geller:
And then we also did the same thing for AZbio. So again, we see the scatÂter plot here. We see that there’s a posÂiÂtive corÂreÂlaÂtion and it’s fairÂly large, so it’s 0.59. But again, it’s not where we want it psychometrically.
Jason Geller:
In addiÂtion to this valÂiÂdaÂtion piece, we also did some exploratoÂry work where we looked at how things like talkÂer and vowÂel conÂtext and manÂner and place affect accuÂraÂcy. And unforÂtuÂnateÂly, I can’t talk about that research today, but what I do want to talk a litÂtle bit about is kind of this IRT one paraÂmeÂter Rausch modÂel that we fit, which we extractÂed all of the item easÂiÂness estiÂmates from. So we can see here. So the palÂlet is not as nice as VioÂlet’s, but I still like the palÂlets here. And we can see that all these items kind of fall withÂin kind of the sweet spot of one to negÂaÂtive one. So there’s not realÂly items that are too hard or too easy, which is someÂthing that we want. And I want to stress this, that we wantÂed to proÂvide someÂthing like this so researchers could use this, and roll their own ITCP, so maybe exclude or include cerÂtain items. So hopeÂfulÂly that will be useÂful to folks that want to do some of the speech and noise work.
Jason Geller:
So to kind of sum up, we see that the ITCP is highÂly reliÂable. So we had an ICC of about 0.8. The validÂiÂty meaÂsures, I think that there that’s an open quesÂtion and I think we need to do more work. So as kind of next steps, we want to look at the valÂiÂdaÂtion in the lab. As I menÂtioned earÂliÂer, we were already startÂing to valÂiÂdate this in the lab and then we had to stop doing that. But the data looks pretÂty good and it’s pretÂty comÂpaÂraÂble from what we’re observÂing online, so that’s what we want to see.
Jason Geller:
One thing that I would be realÂly interÂestÂed in lookÂing at is doing a valÂiÂdaÂtion of this study with indiÂvidÂuÂals with hearÂing impairÂment, so hearÂing aid users and cochlear implant users. I think that’d be realÂly interÂestÂing if we can actuÂalÂly have them stay home, they don’t have to come into the clinÂic, and they can just do this task online and we can use their inforÂmaÂtion like that.
Jason Geller:
And then lastÂly, we want to use this experÂiÂmenÂtalÂly. So we want to do eye trackÂing research, EEG and PET research. And that’s all being planned out right now at the UniÂverÂsiÂty of Iowa. So we’re realÂly lookÂing forÂward to the results that are going to come out from this.
Jason Geller:
So, I want to end this by givÂing some advice that I wish I had when I first startÂed these mulÂti-day studÂies. So, it’s realÂly, realÂly hard to do these mulÂti-day studÂies. There’s lots of attriÂtion. So I wish I would’ve known of these things going into it, which I did not. So one kind of piece of advice is to give bonusÂes for comÂpletÂing the secÂond sesÂsion. So you need to set up sepÂaÂrate studÂies on your recruitÂment platÂform and then just offer bonusÂes for them to finÂish the secÂond task. I think that realÂly incenÂtivizes folks to come back for the secÂond test. I first did this with just havÂing everyÂthing as one sesÂsion and it endÂed horÂriÂbly. There was lots of peoÂple takÂing it and not comÂing back for the secÂond sesÂsion, so that realÂly hurt my numbers.
Jason Geller:
It’s very imporÂtant that you’re explicÂit in your study descripÂtion. So you need to lay out exactÂly what you want the parÂticÂiÂpants to do. And also, so there’s no ambiÂguÂiÂty when parÂticÂiÂpants email you and say that there was some issues with the experÂiÂment or they didÂn’t do the secÂond part, or can I do the secÂond part? You just need to be explicÂit. Very imporÂtant is to email subÂjects mulÂtiÂple times to remind them of an upcomÂing sesÂsion. I don’t know if ProÂlifÂic fixed this, but it was very hard to just let parÂticÂiÂpants that you wantÂed to email sepÂaÂrateÂly. You had to email everyÂone that parÂticÂiÂpatÂed in your study, which is not ideal.
Jason Geller:
And then lastÂly, just try to make your experÂiÂment a reaÂsonÂable length. So for this parÂticÂuÂlar project, each sesÂsion took about 40 minÂutes and realÂly that’s not ideÂal. You want to make sure that it’s manÂageÂable for them to comÂplete, and they’re not bored, or they don’t lose motiÂvaÂtion. So maybe if I had to do this again, and I probÂaÂbly wouldÂn’t have it be so long or I’d spread it out over mulÂtiÂple days so it’s in a reaÂsonÂable length. So that’s kind of my advice or things that I wish I knew when I first startÂed this mulÂti-day experÂiÂments. And with that, thank you. And I look forÂward to your questions.
SpeakÂer 3:
That was fanÂtasÂtic, Jason, thank you so much. As always with your work, I’m just impressed with such top-notch empirÂiÂcal methÂods, and what a deep comÂmitÂment to open mateÂriÂals as well. It’s just wonÂderÂful. We might have time for one quick quesÂtion. Again, we can also use the chat and the Q&A forum and time and GathÂer Town.
SpeakÂer 3:
Okay. ChristiÂna, you can share the slides. One thing that struck me durÂing your talk, Jason, and someÂthing that I think all of us say, we talk about valÂiÂdatÂing what we see online with what we see in the lab and to some degree, I think it’s interÂestÂing that that isn’t reversed. That we’re not kind of reframÂing the narÂraÂtive that why shouldÂn’t we be valÂiÂdatÂing what we see in the lab to a bit more natÂurÂal enviÂronÂment? RealÂly great work.


