How we solve your problem
Search data by
Finding datasets complementary to the existing ones is a challenge. Sci2sci tackles this by providing advanced search options where you can upload a dataset and receive results based on the structural similarity to the dataset you provided (names of groups & columns, metadata etc.) Alternatively, you can search from scratch by typing in keywords you are interested in.
E.g.: Alice and Bob are ML researchers who have developed a model that predicts a likelihood of a person being diagnosed with COVID-19 based on the symptoms the person displays. They want to validate it on data from other regions to see whether it scales. Alice drags and drops their original dataset to sci2sci search and finds an almost identical dataset from another research group at the top of the search results.
Aggregate new data automatically
Our API allows to interact with our system programmatically and build custom pipelines on top of our search and sharing functionality, saving users’ time, automating routine tasks and extending basic sci2sci capabilities.
E.g.: Carol is a cybersecurity researcher. She works on a network traffic analysis tool for detection of malicious activity. She uses various data about network traffic to train ML models for anomaly detection. She receives new data from an in-house testing network and also uses data from external sources she finds on sci2sci. Because most of the datasets she uses receive periodical updates, she automated the process of updating external datasets and model retraining.
Stay up-to-date about new research results and datasets
Receive notifications when new data relevant to your topic of interest appears. You can get updates about keywords and/or their combinations, or even specific file types that get uploaded to sci2sci or other databases that sci2sci indexes.
E.g. Dora is a specialist who explores customer behaviour online to upgrade relevance of search result suggestions. The field she is working in is highly dynamic, and she wants to stay up-to-date with the most recent data. Dora subscribes to the datasets relevant to the platforms and industries her company’s clients are interested in to immediately receive all the latest updates whenever they appear.
Share your data from dusty shelfs to get new insights
If you use sci2sci for data sharing, you will receive a notification whenever your data has been mentioned in other works. That allows you to learn from it without investing your time and effort and to benefit yourself while doing public good.
E.g. Eva and Frank work with Carol in the cybersecurity company. Eva was sniffing network traffic from the testing environment with malicious software examples and collected a lot of datasets. She thought that those datasets could be used by other researchers and told about this idea to Frank who is leading the ML department. They decided to share the data using the sci2sci publishing platform. After a while they recleved a notification about a paper that used their dataset. Frank and Eva applied the algorithms described in that paper in their main network analyzer product and improved their users' protection.