Date post: | 29-Jan-2018 |
Category: |
Technology |
Upload: | daniel-kershaw |
View: | 226 times |
Download: | 3 times |
“Language,neverforget,ismorelikefashionthanscience,andmattersofusage,spelling,andpronunciationtendtowanderaroundlikehemlines”- BillBryson,TheMotherTongue:EnglishandHowItGotThatWay
LanguageisinconstantchangeOnlinecommunicationaddsextrapressurethoughthemergingoftimeandspace
– Awesomesauce– Bants– beero’clock– brainfart– Brexit– bruh
LanguageisContentlyChanging
Metcalf’sFudgeFrequencyofthewordUnobtrusivenessoftherodDiversityofusersandsituationsGenerationofotherformsandmeaningsEnduranceoftheconcept
GroundedModels
Barnhart’sVfrgt(V)Numberofforms(F)Frequencyofword(R)Numberofsources(G)Numberofgenera(T)TimeSpanofWord
LinguistsandlexicographersaimtounderstandlanguageDevelopedheuristicstoaidthedecisiontoincludewordsindictionaries
TheData
Twitter Redditu
Users 3,108,844 ≈25,000,000
Posts 73,528,954 ≈500,000,000
Communities 3046 121,373
Words(n>200) 373,217 2,712,629
TimePeriods 283days 880days
uDatagatheredfromtheJune2015Reddit DataReleasehttps://goo.gl/j116ML
VariationinFrequencyAssesschangedinrawfrequencyanduserfrequencyovertime
DiversityinFormAssessusersadoptionofvaryingformse.g.additionsofing
DiversityinMeaningOvertimecanweseeaconvergenceinmeaningoftheword
Measures
AssesstheprefixandsuffixadditionofaninnovationListofprefixandsuffixesfromtheOED
– apple->apples– hero->antihero
DiversityinForm
LookingforinnovationsthathavenotbeenseenbeforeNosolidifiedmeaningwithinexistingsystemse.g.WordNet
Learnstheembedding ofwordswithinacorpususingword2vecDevelopedbyGooglein2009Usesdocumentstotrainneuralnet
DiversityinMeaning- word2vec
DiversityinMeaning- word2vec
Time(t)
Community(c)
w2v
w2v
w2v
w2v
w2v
w2v w2v
w2v
w2v
w2v
w2v
w2v
w2v
w2v
w2v w2v
w2v
w2v
w2v
w2v
w2v
w2v
w2v
w2v w2v
w2v
w2v
w2v
w2v
w2v
w2v
w2v
w2v w2v
w2v
w2v
LookingforstatisticallysignificantgrowthordecayofaninnovationPresumelanguagechangehappensinamonotonicfashionFitSpearman'sranktoeachtimeseries
XvalueisdayssincestartofdataYvalueisnormalizedfrequencyofword
Valuerange-1to1
SamplingtheData
WordoftheYear
Collinsbinge-watch,verbcleaneating,nouncontactless,adjectiveCorbynomics,noundadbod,nounghosting,nounmanspreading,nounshaming,nounswipe,verbTransgender,adjective
Oxford😂
Adblocker,nounBrexit,nounDarkWeb,nounOnfleek,adjective phraseLumberserxual,nounRefugee,nounSharingeconomy,nounThey(singular),pronoun
UsersadopttheirlanguagetothosearoundthemTheyaccommodatetheirlanguageforanumberofreasons,
• Impresspeople• Dominatepeople• Looksophisticated
UsersandLanguage
StructuralHoles
• Thepositionofauserinanetworkgivestheuserdifferentlevelsofpower
• Userwhomspanstructuralholeshaveaccesstomorevariedinformation
• Allowinformationtoflowbetweencommunities,
Howtomeasure
Conceptually,constraintreferstohowmuchroomyouhavetonegotiateorexploitpotentialstructuralholesinyournetwork.
2
÷÷ø
öççè
æ+= å
qqjiqijij pppC
Cij =Directinvestment(Pij)+Indirectinvestment
1
2
4 5
3
Lessconstraint
Method
• Weknowthesizeofthefinaldiffusion(N)• Whatistheaverageateachindividualusage(n)ofaninnovation
• Classify/groupdiffusionsbaseoffinalsize
• Youcouldcreatethenextviralword,• Anewwordhastobe:
– Beusableinmanydifferentcontexts– Usedacrossdifferentcommunities– Nottodifferentfromexistinglanguagesounds
TakeHome