Article From:https://www.cnblogs.com/CaiZiYang/p/9684121.html

prototype model

Prototype design tool: Axure RP 8

  • Model home page:

    Function Description:

    • Select File: select paper list file from local disk as query.
    • Retrieval Button / Update Data Button: Clicking the Retrieve button after selecting the file displays the crawled data in the following text box
    • Hot Word Statistical Button/Hot Word Trend Button: Used after the button is pressed, the functions of the button are: Hot Word and Hot Word Trend in the past three years, respectively.

Contents

1. background”

Xiao Ying is a junior student, has been obsessed with eating chicken games, one day heard with the dormitory wolf just went to attend the CVPR meeting with his tutor, heart envy, then determined to make a painful change, hard to study, hope to complete before graduation an outstanding paper standing at the forefront of the times. But what’s troubling is thatHe did not know the hot areas and research directions of the top conference in recent years. According to the paper list, it was too inefficient to find and summarize articles one by one. So he turned to the “Soft Work Practice Mutual Aid Organization” and hoped that we could help him design a platform to meet the needs of the present stage.

2. design objectives”

  • 2.1requirement analysis
    • Users give a list of papers.
      • 1.It is required to crawl the title, abstract and link of the thesis through the list of papers.
      • 2.It can realize the addition and deletion of the list of papers (this year, nearly two years, nearly three years).
    • The information of crawling is structured to analyze TOP10 hot areas or popular research directions.
      • 1.The thesis attributes (oral, spotlight, poster) can be screened and analyzed.
      • 2.Form an intuitive way of viewing such as hot word maps.
    • Papers can be retrieved. When the user enters the basic information such as paper number, title, author, etc., the relevant paper, source code, home page and other information can be analyzed and returned.
    • Can be years, different top of the hot words show the trend of heat contrast (here will be limited to the three top computer vision CVPR, ICCV, ECCV).
    • Data statistics can be carried out, such as the analysis of each country’s employment articles, the analysis of each school’s employment articles, which schools which research direction is stronger, etc.
    • Additional requirements:
      On the premise of not changing the design concept and conforming to the user’s usage habits, we can expand and upgrade on the basis of the above requirements, or use imagination to add our own ideas to the prototype.
  • 2.2performance index
  • From the point of view of demand, the most important time is spent on crawling the paper information and storing the paper structure. The user’s target volume is about 9000 papers (three-year papers).3Grand SummitEach top will be 1000. The title of the thesis is not more than 200 characters, and the total amount is 200W characters.
    • 1.The response time of 5S is needed to complete the title and storage of all papers.
    • 2.(Crawl time to be detected)
    • 3.The time of searching papers and corresponding information and forming hot words atlas is in 1s.
    • 4.Adding functions can search for papers through thesis numbers and key words.

      3. module design”

  • 3.1Module flow chart and explanation
    • The design and implementation process is as follows:
      Climbing the paper information (paper number, author, link to the original text, etc.) from the web page with a list of papers given by the user
    • Use C++ data structure and map< string, class> container storage crawl to the paper information.
    • Data retrieval, hot word statistics (adding the ability to retrieve related articles through article keywords) and data analysis via a map container
    • The statistical results are displayed in the form of images.
  • 3.2Data structure specification
//Pseudo codePaper category:{Properties:Paper ID; Paper title; Paper abstract; Paper year; Paper author; Paper attributes (oral, spotlight, poster); Key words (used as words)Frequency statistics); type (indicating which meeting it belongs to).Member function:Gets the values of class attributes;};
  • 3.3Algorithmic description
    • The first step is to use Python to write a crawler embedded in C++, store the resulting data in a file, and traverse the file in a map container (based on the key, value value storage function)
    • Through the map container red-black binary tree for data query access, make word frequency statistical analysis, and search filtering algorithm.
  • 3.4Interface with other modules
    • 1.Reptilian interface
    • 2.Class interface
  • 3.5exception handling
    • 1.Failed to read the list of papers.
    • 2.Failed to crawl papers.
    • 3.Failed to read crawl data.
    • 4.There is no user input headline in the list of papers.
  • 3.6Test considerations
    • Unit test: crawler test, class read information test, word frequency sorting test
    • Integration testing: can the test get the desired effect?
    • Test tool: Visual Studio 2017

      4. system integration packaging”

  • Packaging the system into a windows form software with interface.

    5. document summary

  • Our product “crawl music” is to facilitate users to obtain batch papers information, they need to get the desired information through a given list of papers and the corresponding retrieval statistics function, but the existing products do not solve these needs well, we have a unique batch of access to web information methods, they canIt is very convenient for users to find papers and to show the trend of the research papers more clearly.


problems encountered and Solutions

  • Problem: acquisition of thesis information
  • Coping mode: Baidu Python crawler embedded in C++
  • Question: Statistics of hot words
  • Coping style: word frequency statistics of homework 1
  • Problem: storage, processing and analysis of information
  • Coping mode: using files to process data and constructing corresponding data structures for statistical analysis.

PSP form”

PSP2.1Personal Software Process StagesEstimated time (minutes)Actual time consumption (minutes)
Planningplan90120
· Estimate· Estimate how much time this task will take.90120
DevelopmentDevelopment310440
· Analysis· Requirements analysis (including learning new technology)180240
· Design Spec· Generate design documents6090
· Design Review· design review1020
· Coding Standard· Code specification (to develop appropriate specifications for current development)00
· Design· Specific design6090
· Coding· Specific coding00
· Code Review· Code Review00
· Test· Test (self test, modify code, submit Revision)00
ReportingPresentation2540
· Test Repor· Test report00
· Size Measurement· Computational effort510
· Postmortem & Process Improvement Plan· Summarize afterwards and propose process improvement plan.2030
Total425600

  • After discussing the needs with Le Zhonghao students, we still know a little about how to realize the needs at first, so we study each other, and then continue to discuss the objectives on the basis of searching for the information. After analyzing and discussing the feasibility several times, we have determined the current implementation method. Pairing is still very useful, and we can find each other.Shortcomings, mutual promotion and mutual progress.
  • The following is the ugliness.
    image

Week NNew code (row)Cumulative code (row)Hours of study (hours)Cumulative learning hours (hours)Important growth11001001515Learning the method of C++ embedding Python; the method of Axure prototype design.

Link of this Article: Third jobs for soft workers