本文是将我发在 Steem 上的两篇帖子 Steemit 的新手们,你们考虑过数据隐私的问题吗 和 开发了一个易用的在线小工具 steemr 合在一起重新编写的。读过这两篇的就不必再读了。
Steemit,新生事物,最近因为土木坛子晒钱包,大家看到之后纷纷眼热,于是蜂拥而至。不知用户有没有意识到自己在 Steem 上的个人隐私问题。要知道,用户在 Steem 上的一举一动,都是记录在案的。有人说,在互联网上没有隐私可言,什么都查得到。但是我认为 Steem 不一样。比如我想知道你的微信或 QQ 账号有哪些好友,你最喜欢跟谁聊天,你一般几点钟最活跃,这些数据虽然在腾讯服务器上,但是我获取不了。而 Steem 上的这些数据,是对所有人开放的,只要别人想,就可以得到。
那么,别人到底能看见你的哪些信息呢?
下面,经土木坛子允许,我们用他的账号 @tumutanzi 的公开数据(https://steemdb.com/, 采集时间 2017-07-26 22:06:46 CEST),进行一点粗浅的展示,所用的工具是 R 语言。数据处理得比较仓促,连分析都算不上,如有错误,抱歉得很,请先原谅我,然后指正。
@tumutanzi 的粉丝有 319个ID,按照受欢迎程度(即他们各自被多少ID关注)排序的词云是这样的:
凡是看得清的,基本我都不认识。 @abit 大神在这里并不起眼。
按照发帖数排序是这样的:
@abit 和 @oflyhigh 两位大神似乎也并不显山露水。
按Vest(是什么?富有程度?)排序是这样的:
吓死人了。 @abit,拥有 4077000000 Vests,鹤立鸡群,弄得别的ID都区分不出来了。这叫闷声发大财么?
我们把他踢出去再看看:
@oflyhigh 和 @xiaohui 现身了。
另一方面,我们看看 @tumutanzi 都关注了谁。
@tumutanzi 关注的ID总数是 305(包含后来他取消关注的)。其中,有152个 ID 跟 @tumutanzi 是双向关注的,而153个ID 虽然被他关注,但却没有关注他。奈何明月照沟渠。可怜的 @tumutanzi,这些不搭理他的 ID 是:
## [1] "ace108" "act" "adm"
## [4] "adoal" "adsactly" "alice"
## [7] "anwenbaumeister" "arcurus" "arhag"
## [10] "azazqwe" "batel" "ben"
## [13] "birds90" "bitcube" "blocktrades"
## [16] "blueorgy" "bob" "brothermic"
## [19] "btsabc" "bziing" "candiceji"
## [22] "ceciliali" "chinadaily" "cnfund"
## [25] "coldhair" "cornelia" "cqf"
## [28] "created" "cybercodetwins" "damarth"
## [31] "dan" "dantheman" "databass"
## [34] "davidyuan" "deadlinefarmer" "dek"
## [37] "dixonloveart" "dmilash" "ekitcho"
## [40] "elfkitchen" "engagement" "everittdmickey"
## [43] "feelapi" "flysaga" "freedom"
## [46] "frostfan" "fuzzyvest" "fyrstikken"
## [49] "gaoduzhu" "gavvet" "gefa"
## [52] "glitterfart" "goku1" "guyverckw"
## [55] "haster" "hendrikdegrote" "hingman"
## [58] "holabebe" "huaji" "icedream"
## [61] "incrediblesnow" "ivanba12" "j-bex"
## [64] "jademont" "jaewoocho" "jamesc"
## [67] "jamesc1" "jasonmcz" "jeicko19"
## [70] "jholdsworthy" "jinbo" "kawaiiiiiiii030"
## [73] "kenchung" "kitcat" "krischy"
## [76] "laodr" "laoyao" "lcb"
## [79] "lief" "linuslee0216" "logic"
## [82] "lukestokes" "matrix" "miacats"
## [85] "michael-a" "michael-b" "microluck"
## [88] "minfon" "minority-report" "monkimo"
## [91] "morten" "mottler" "ned"
## [94] "nextgen622" "nicolemoker" "nuagnorab"
## [97] "orientaledu" "papa-pepper" "penguinpablo"
## [100] "peterpan" "pfunk" "pharesim"
## [103] "pilgrimtraveler" "pipertomcat" "pistox"
## [106] "pkhope" "project11" "proskynneo"
## [109] "publicworker" "puffin" "ranchorelaxo"
## [112] "rawnetics" "rayphu" "rea"
## [115] "recursive" "renohq" "riverhead"
## [118] "roadscape" "robrigo" "rok-sivante"
## [121] "rook" "safari" "scorpion"
## [124] "sean1980" "smooth" "someone"
## [127] "stan" "steem" "steemed"
## [130] "steemit" "steemitblog" "stellabelle"
## [133] "str11ngfello" "sunnyray" "susanlo"
## [136] "tamim" "teamsteem" "thecryptofiend"
## [139] "thejohalfiles" "trafalgar" "transisto"
## [142] "val-a" "val-b" "wackou"
## [145] "wanbin" "waterflier" "webclerk"
## [148] "witness.svk" "wjm" "wjs2634"
## [151] "wongshiying" "xeldal" "zoef"
反过来,关注了他却未被他关注的有:
## [1] "abit" "abusalam" "acrich"
## [4] "adraaz" "alexieong" "alfredchen"
## [7] "aniksh" "artsyppl" "asskick"
## [10] "aster" "atc" "atrix"
## [13] "avilsd" "avinger" "babasikandar"
## [16] "baixu1018" "bajeheart" "bassemm"
## [19] "bbkivines" "beautifulbella" "benou"
## [22] "berke" "biddle" "bilalhaider"
## [25] "bindu" "bocaiwen" "brianchen"
## [28] "britt.the.ish" "carinewhy" "carolynseymour"
## [31] "catwomanteresa" "chumhumnews" "cryptochamo"
## [34] "danishali" "danm2017" "davidad"
## [37] "daxiblog" "dcc" "deanliu"
## [40] "diablocro" "diaohuijun" "dimidrolshina"
## [43] "dinsha" "dobro88888888" "dominicmi"
## [46] "dragos0897" "dtworker" "fitinfun"
## [49] "frankbuse" "gamemusic" "gatmi"
## [52] "goaln" "gowldie" "gribtertong"
## [55] "grildrig" "hamzaoui" "harttwohartllc"
## [58] "hebro" "hilarski" "hofffman"
## [61] "hongmeng" "hoof" "hqfzone"
## [64] "imagediet" "imako" "initnas"
## [67] "investlimestone" "iqbalbireuen" "irphotography"
## [70] "ishaq" "istox" "iszarl"
## [73] "itan05" "itissimple" "jackmiller"
## [76] "jamal83" "jason-donovan" "jhonnight"
## [79] "jones420" "joseburgos" "jukas"
## [82] "junyi" "justyy" "kabindra"
## [85] "katythompson" "khairulmuammar" "kishore1988"
## [88] "krasotka" "kristjannacaj" "landeberg"
## [91] "lautenglye" "ledygaga25" "lemooljiang"
## [94] "liangfengyouren" "lifeofkly" "lifishake"
## [97] "lilywang" "liufeng" "luckerme"
## [100] "lykencrypto" "lynx" "marianpancu"
## [103] "mehdioraki3973" "minknsa" "mkotibabu"
## [106] "monalishabiswas" "mydarlings2" "myfirst"
## [109] "mytamilabiz" "natashahall" "nationalpark"
## [112] "nawar93" "newsagg" "notregme"
## [115] "oflyhigh" "ohammedfelahi" "patricksanlin"
## [118] "phyohtetpaing" "powerfj" "qipashuo"
## [121] "redje" "renua" "revolusi"
## [124] "riansteem" "rifattb" "robi8888"
## [127] "rodneyaspiras" "rogerwilson" "sbamsoneu"
## [130] "secora" "securetalk" "sequentialvibe"
## [133] "shaheer001" "shankhunsam" "shengjian"
## [136] "sheval" "showoff" "skynyc"
## [139] "somebody" "stackin" "steem.engine"
## [142] "steemitguide" "sthitaprajna" "suitup"
## [145] "sunnyjolly" "superhardness" "swssmarketing"
## [148] "tastetwist" "themessengers" "toninux"
## [151] "trevis" "vax-rambo" "vishal1088"
## [154] "walkman90" "warrkin" "whydowork"
## [157] "worldwidetravel" "xiaohui" "xiaokongcom"
## [160] "xjb" "yellowkersie" "ygern"
## [163] "yuxi" "zerozero777" "zhijun"
## [166] "zhuqiankun" "zwal"
奇怪, @abit 和 @justyy 明明在 @tumutanzi Steemit 的 following 名单里,但是在 steemdb 里却没有.
Steem 上任何 ID 的类似信息都可以轻易查到。为此,我开发了一个在线工具 steemr,方便那些不懂编程或懒得编程的朋友来查询 Steem 信息。查查自己的,查查别人的,想查谁就查谁吧,出了事儿别找我。
steemr 用起来很简单:
点击
Follower report
标签,输入你想查的 ID,点击GO
按钮,等几秒钟或十几秒钟(取决于服务器的速度和 你要查的 ID 的情况)就可以看到结果了,包括粉丝 (followers) ID 的词云图,按这些粉丝的粉丝数量、发帖数量和vests大小排序。如果粉丝超过30个,就取前三十个。词云图下方,还给出了尚未互粉的 ID 列表。点击
Voter report
标签,输入你想查的帖子链接(可以是完整链接,也可以是从@符号开头到末尾),点击GO
按钮,等几秒钟或十几秒钟,就可以看到为该帖点赞的名单和百分比。图下面给出了尚未投票的粉丝名单(类似的工具有 @justyy 的网页界面 , @beautifulbella 的Excel 方法, @oflyhigh 的Python 方法)。
本工具以后会进一步开发完善。这只是冰山的一角。
再次提醒大家:在 steemit 发文发照片要三思,发布之前多想想,免得后悔。在区块链上,水一旦泼出去,就再也收不回来了。